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1. INTRODUCTION 



One of the primary goals of theoretical physics is to provide the simplest models cap- 
turing various complex physical phenomena. In condensed matter physics, as well as in 
statistical mechanics more broadly, spin systems often serve in this role. As the Ising 
model and other idealized classical spin models have been invaluable in forming our un- 
derstanding of thermal phase transitions and critical phenomena, so are various quantum 
spin models now instrumental in developing a theoretical framework for exotic quantum 
many-body states and quantum phase transitions, i.e., phase transitions driven by quan- 
tum fluctuations (controlled by some tunable interaction parameter) at temperature T = 
[1,2]. With their many possible ordered and disordered ground states and different types 
of excitations arising from them, quantum spin systems also provide rich opportunities 
to study other manifestations of collective quantum behavior [3,4]. 

Although often used as simplified prototypical model systems for various phenom- 
ena, and not always intended for describing fully all the details of specific real materials, 
quantum spin systems have also been very successful in explaining quantitatively the an- 
tiferromagnetic properties of a variety of Mott insulators with localized electronic spins. 
A prime example of this is provided by the undoped parent compounds of the high- 
temperature superconductors and other related quasi-two-dimensional and quasi-one- 
dimensional copper oxides. Their experimentally measured magnetic response func- 
tions can be remarkably well reproduced by the S = 1/2 Heisenberg model on two- 
dimensional (2D) planes [5, 6], isolated chains [7, 8], and "ladders" [9] consisting of 
two or more coupled chains. In agreement with model calculations, 2D layered systems 
exhibit an exponentially divergent correlation length as T is lowered (until ordering sets 
in below some critical temperature due to 3D couplings or anisotropics), while chain 
and ladder compounds exhibit only short-range (power-law or exponentially decaying) 
correlations. In addition to cuprates, many other inorganic and organic antiferromagnets 
also show similarly good agreement between theory and experiments [10]. 

A prominent research theme in contemporary condensed matter physics is to model 
and explain magnetically disordered ground states of 2D or quasi-2D materials with 
non-uniform or frustrated (competing) antiferromagnetic interactions [11, 12, 13, 14]. 
Quantum phase transitions in 2D spin systems challenge the classical Ginzburg-Landau 
framework [15] for understanding and classifying phase transitions based on order 
parameters, as exemplified by the recent theory of "deconfined" quantum critical points 
[2, 16], which separate antiferromagnetic (Neel) and non-magnetic valence-bond solid 
(VBS) ground states [17]. In a field-theory proposed to describe this quantum phase 
transition (the non-compact CP^ model), deconfined spinons (collective S = 1/2 degrees 
of freedom) are the "elementary particles", out of which the two order parameters can 
be formed due to condensation (in the Neel state) or confinement (in a VBS state) [16]. 
Apart from the interest in such unusual phase transitions in condensed matter physics, 
there are also intriguing connections to deconfinement in gauge theories in particle 
physics [18]. Interacting quantum spins have also recently become interesting in the 
context of ultra-cold atoms in optical lattices [19, 20], as well as in quantum information 
theory [21]. Fundamental many-body concepts such as entanglement entropy [22] are 
currently explored in various ground states of quantum spin systems [23]. 

Exact solutions of quantum spin systems are very rare beyond one dimension, where 
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there are several important cases (enough to fill a whole encyclopedia [24], in fact). In 
two dimensions there are also some examples [25, 26], but normally analytical calcula- 
tions rely on approximations or assumptions that cannot be rigorously justified. Purely 
computational studies of model hamiltonians are therefore also essential. Unbiased nu- 
merical results are important for testing theories and analytical calculations (in particu- 
lar, continuum field theories for the low-energy physics). Moreover, numerical "simula- 
tions" can also in their own right serve as laboratories for exploration and discovery, and 
may thus stimulate further theoretical and experimental developments. 

In classical statistical physics, almost any model can be studied in detail using Monte 
Carlo or molecular dynamics simulations (although there are also challenging classical 
systems, e.g., ones with very slow, "glassy" dynamics [27]). The situation is different 
in quantum mechanics. There are still enormous hurdles limiting computational studies 
of generic quantum spin hamiltonians, especially ones with frustrated interactions and, 
going beyond pure spin models, strongly correlated fermion systems. Devising efficient 
practically useful numerical algorithms for these types of systems is one of the greatest 
challenges in theoretical physics. Thanks to a series of significant developments over the 
past couple of decades, large-scale computational studies have already become possible 
for some important classes of quantum lattice models. Very large ID systems can be 
studied using the density matrix renormalization group (DMRG) method [28, 29] or 
related methods formulated using matrix-product states [21, 30]. Quantum Monte Carlo 
(QMC) methods with loop-cluster updates [31, 32, 33] can be used to study a wide 
range of spin and boson models in any number of dimensions, typically on lattices with 
up 10^ sites or more in the ground state, and much larger still at elevated temperatures. 
In addition to breakthroughs in efficient algorithms, the impressive improvements in 
computer performance have of course played an important role in recent progress, too. 
The most dramatic gains have, however, been achieved as a result of better algorithms, 
and there is reason to believe that this will continue to be the case in the future as well. 

Topics. Two classes of computational methods will be discussed in these lecture 
notes: Numerical (exact) diagonalization and quantum Monte Carlo simulation. Exact 
diagonalization methods will be developed primarily for ID systems, followed by some 
discussion of extensions to 2D square-lattice systems. The use of symmetries for block- 
diagonalization will be developed and used in both complete diagonalization {T > 
calculations) and with the Lanczos method (for obtaining the ground state and low- 
energy excitations in a given symmetry sector). QMC simulations based on the series 
expansion of the partition function (stochastic series expansion; SSE) will be developed 
for r > calculations (and applied also in the limit T ^ 0). Computer programs written 
in close correspondence with the pseudocodes are available on-line [34]. 

Beyond describing the technical aspects of the numerical methods, an integral goal of 
these lecture notes is to introduce the most essential quantum spin models (hamiltonians) 
and to present some of their physical properties from a computational perspective. While 
the discussion is largely self-contained as far as the algorithms and implementations are 
concerned, the physics of the systems is for the most part discussed in a rather "light" 
fashion, in the form of elementary calculations (e.g., spin-wave theory) and qualitative 
descriptions illustrated by numerical results. Connections to complementary analytical 
approaches (e.g., predictions based on field theories) are also pointed out, with key 
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references for further study. The topics range in maturity from well established basics to 
very recent and ongoing research on exotic quantum phase transitions. 

Outline. The quantum spin models to be discussed in the subsequent sections are 
first introduced in Sec. 2, along with a brief summary of various types of ground states 
and quantum phase transitions. Classical phase transitions, Monte Carlo simulations, and 
finite-size scaling techniques are reviewed Sec. 3 in order to set the stage for quantum- 
mechanical finite-lattice calculations and data analysis. Exact diagonalization techniques 
and their applications to ID spin systems are discussed in Sec. 4. Basic properties of the 
Heisenberg chain and its extension with frustrated interactions are illustrated with nu- 
merical results (including the frustration-driven quantum phase transition into a dimer- 
ized VBS state). Extensions of the methods to 2D systems are also summarized, and 
used to study the low-energy states (quantum-rotor states) of small antiferromagnetic 
systems. Sec. 5 begins with a general discussion of path integrals, followed by the al- 
ternative series-expansion formulation of quantum statistical mechanics, on which the 
SSE QMC method is based. The SSE method is then developed in detail for the 5=1/2 
Heisenberg model. Illustrative results for chains, ladders, and 2D planes are presented, 
including a study of quantum-criticality in dimerized 2D systems. Applications of the 
SSE method to "J-Q" models with four- and six-spin interactions are also discussed, and 
the Neel-VBS transitions occurring in these systems as a function of the strength of the 
multi-spin iteractions are studied. Sec. 6 concludes with a brief survey of other recent 
works related to the topics of the lecture notes. 

2. QUANTUM SPIN MODELS, THEIR GROUND STATES AND 
QUANTUM PHASE TRANSITIONS 

In solid-state physics, quantum spin hamiltonians describe the effective magnetic inter- 
actions between localized electronic spins. As such, they can be derived starting from the 
full problem of interacting electrons [35, 36, 37]. Here we will not discuss their relation- 
ships with real materials in detail, but just introduce some well established models with 
Heisenberg couplings and other, similar interactions. To motivate and prepare for the 
quantitative numerical calculations in the later sections, in this introductory part we will 
first survey some of the possible types of ground states and quantum phase transitions. 
We will discuss the nature of the quantum fluctuations in spin systems from different 
perspectives, including spin wave theory and singlet pairing (valence bonds). 

The Heisenberg exchange is the most important spin-spin interaction and forms the 
starting point for understanding many materials and phenomena in quantum magnetism. 
Two spins are coupled according to the hamiltonian 

Hij = JijSi ■ Sj = JijiS^S^j + Sysy + 5^5^. (1) 

Often this pair interaction is summed over only nearest-neighbor sites (z, j), but longer- 
range interactions can also be included. The type of ground state, the nature of the ex- 
citations, and the finite-temperature properties of a system with Heisenberg interactions 
depend strongly on the underlying lattice. The dimensionality plays a crucial role. Ac- 
cording to the Mermin-Wagner-Hohenberg theorem [38, 39], a continuous symmetry 
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of a quantum system with short-range interactions, here the global SU(2) spin rotation 
symmetry, can be broken neither at T > in one dimension nor at T > in two dimen- 
sions. This normally rules out magnetic order in ID Heisenberg models (unless there 
are long-range interactions, in which case the theorem does not apply — we will discuss 
an example of this in Sec. 4.3.3), but in two dimensions the ground state can be mag- 
netic, i.e., (S() ^ (with all these vector expectation values parallel in a ferromagnet 
and alternating between two opposite directions in a bipartite antiferromagnet). Beyond 
dimensionality, the microscopic details of the lattice and the coupling strengths Jij (e.g., 
uniform, modulated in some periodic pattern, or in some random, disordered way) are 
also decisive, and many different types of ground states and quantum phase transitions 
can be realized. Some of these states and transitions are still not very well understood 
and subjects of ongoing research. 

Apart from a brief review of spin wave theory for general 5, we will in these lecture 
notes focus on the simplest case of 5 = 1/2 spins, corresponding to individual uncom- 
pensated electronic spins. This is often the most interesting case, as 5 — > oo is the classi- 
cal limit, and S = 1/2, thus, maximizes the effects of quantum fluctuations (although in 
some cases, S = 1 or higher can actually lead to even larger quantum effects, e.g., in the 
case of the "Haldane state" of the 5=1 chain [26]). We will only consider antiferromag- 
netic interactions, Jij > in (1), which from a theoretical perspective are more interest- 
ing than ferromagnetic couplings. Antiferromagnetic interactions in strongly-correlated 
systems are also more prevalent in nature. A much broader area of quantum many-body 
physics can be entered by also allowing anisotropics in spin space, i.e., different jc, y, and 
z coupling strengths in (1). In addition to the relevance of such anisotropies in many real 
magnetic materials, the mapping between 5=1/2 spins and "hard-core" bosons makes 
such models interesting also for other reasons. The spin-isotropic Heisenberg interac- 
tion can be considered the essence of quantum magnetism, however, and we will focus 
almost exclusively on this case here. 

Section Outline. After discussing the antiferromagnetic (Neel) state and the nature 
of its quantum fluctuations based on spin wave theory in Sec. 2. 1, two important classes 
of non-magnetic states — spin liquids and valence-bond solids — will be discussed in 
Sec. 2.2. In Sec. 2.3 the special properties of ID systems are briefly reviewed. Sec. 2.4 
introduces the extended quantum Heisenberg models and the types of quantum phase 
transitions that we will study in much more detail in the later sections, in applications of 
the various computational methods. 



2.1. The Neel state and its quantum fluctuations 

On the 2D square lattice (and other bipartite 2D and 3D lattices with uniform inter- 
actions) the ground state of the Heisenberg model with only nearest-neighbor interac- 
tions is antiferromagnetically (Neel) ordered, with neighboring spins being oriented, on 
average, in an antiparallel (staggered) fashion. Note that a maximally ordered antiferro- 
magnetic state, e.g., | titi; ■ ■ ■) on a chain (or a checkerboard pattern of t and 4- spins 
on the 2D square lattice), is not an eigenstate of the Heisenberg hamiltonian, whereas a 
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fully polarized ferromagnetic state, e.g., | tttt • • •), is an eigenstate (and, in the case of 
ferromagnetic interactions, has the minimum energy). This can be easily seen with the 
pair interaction (1) written as 

Hij = Jij{S^S) + S]S] + Sf^p = Jij[S\S) + i {SfS^ + 5r5t)] . (2) 

When acting on the perfect Neel state, the raising and lowering operators flip pairs 
of spins, causing local defects, whereas they destroy the perfect ferromagnetic state. 
The antiferromagnetic order must therefore always be reduced by quantum fluctuations, 
whereas the fully polarized ferromagnetic state is the ground state also in the presence 
of the off-diagonal interactions. The amount of magnetic order (if any) remaining in the 
true ground state of a system with antiferromagnetic interactions depends sensitively on 
details of the lattice and the interactions included in the hamiltonian. 

Note again that the magnetic order parameter of a system with Heisenberg interactions 
(e.g, the magnetization of a ferromagnet or the sublattice magnetization of an antiferro- 
magnet) is a vector in spin space. The sublattice magnetization operator is 

™^ = ^ E ^i^i^ (3) 

where 0, = ±1 is the staggered phase factor, e.g., on the 2D square lattice 0,- = ( — 1)^'+^', 
where and >', are the lattice (integer) coordinates of site /. In a Neel state the expectation 
value (m^) = m^x + m;^y + m^z is non-zero in the thermodynamic limit. The order can 
form — the spin-rotational symmetry can be broken — in any direction in spin space. For 
convenience one normally associates the z spin components with this direction, so that 
the staggered magnetization (ms) = | (5?) | . In a finite system a non-zero magnitude m.v of 
the Neel order can form, even though the direction of the vector remains fluctuating over 
all angles and, thus, (m^) = 0. In a calculation for a finite system one should therefore 
detect the presence of order using a quantity which is independent of the direction, e.g., 
(m^) or (|m.s|). We will see many examples of finite-lattice calculations later. First, let us 
discuss some of the basic properties of the symmetry -broken Neel state in more detail. 



2.7.7. spin wave theory 

The Neel state and its excitations in 2D and 3D systems can be understood within a 
simple linear spin wave theory (discussed in more detail in many standard works, e.g., 
the review article by Manousakis [6] and the book by Mattis [36]). Such a calculation 
starts from a maximally ordered state of staggered spins, which is the exact ground state 
in the classical limit where the spin magnitude 5 — )■ oo. This is regarded as a vacuum 
state, on top of which quantum effects are included systematically by adding bosons, 
representing the deviations of the spins from l^^l = S, in such a way as to obtain a good 
approximation to the ground state of the system for finite S. 

The relationship between spins and bosons for 5 = 1/2 is illustrated in Fig. 1. The 
reason for using this mapping is that it is easier to carry out calculations with the bosons, 
due to their simpler commutation relations. Linear spin wave theory corresponds to non- 
interacting bosons and is, by construction, exact for S ^ oo. Results for finite S can be 
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spins ^^^^^^^ ^^^^^^^ 
bosons — — — — — — — — 

FIGURE 1. Correspondence between S ~ 1/2 spins and bosons, here illustrated for a ID chain. The 
fully staggered reference state (left) is the vacuum for bosons (with - indicating zero boson occupation). 
For each spin flipped with respect to the reference state there is a boson (•) at the corresponding site 
(right). For S =1/2 the boson occupation numbers are or 1 on each site, while for arbitrary spins they 
are 0, . . . ,25. Linear spin wave theory corresponds to non-interacting bosons, and the occupation number 
constraints are not enforced. Higher-order calculations m\/S gradually restore the constraints. 



systematically improved by including interactions in the form of an 1 /5 expansion. Here 
we just outline the lowest-order calculation. 

If we neglect the constraint that the boson number n,- for each site should be within the 
range 0, . . . , 25 (the physical subspace), we can use the following simple (lowest order 
in 1/5) mapping between the spin operators in the original hamiltonian (2) and boson 
creation and destruction operators af and a,- (and the number operators nt = afat);^ 

i G t sublattice : 5f = 5 - n,-, 5+ = VlSai, 5r = V2Sal 
/ G X sublattice : S~; = nj — 5, 5^ = V^Sa'f , 57 = V^Suj. 

It is useful to look at Fig. 1 to understand this mapping — apart from the obvious way in 
which the off-diagonal operators can affect the states, one just has to be careful with the 
different factors associated with boson creation/destruction and spin raising/lowering 
(discussed in standard quantum-mechanics texts). The factors in (4) are correct for 
fii <^ 5, but note that they are also exact for 5 = 1 /2 in the physical subspace. In principle, 
one can write down more complicated expressions that are formally correct for any 5 and 
rii in the physical subspace (and also automatically decouple the physical and unphysical 
subspaces), but these are only relevant when going beyond linear spin wave theory. 

We now transform the terms in the hamiltonian (2) using the mapping (4). Because 
we are considering a bipartite system, where the two sites i and j are always on different 
sublattices, we obtain the off-diagonal term 

i(5+57 + 575+) ^ 5(a,a; +a+a+), (5) 

and the diagonal interaction is 

S]S) -5^ + 5(n/ + Uj) - fiiUj. (6) 

spin wave theory should formally be regarded as a large-5 calculation. In the lowest- 
order calculation we should therefore discard the interaction term in (6), because 
it is a factor 1/5 smaller than the non-interacting contributions from (5) and (6). Let us 



Often two species of bosons, a,- and bj, are used, corresponding to the two sublattices. In momentum 
space, the use of a single boson species implies that we are here working in the full Brillouin zone of the 
original lattice with momenta, instead of one with N /2 momenta corresponding to a two-site unit cell. 



Computational Studies of Quantum Spin Systems 



January 18,2011 



9 



for definiteness consider the two-dimensional square lattice with N = !?■ sites. We then 
have the effective hamiltonian 

N 

H = -INS^J + ASJ Hi + 57 £ {aittj + afa^) . (7) 
'•=1 (ij) 

Here it should be noted that we consider a finite system but assume that the symmetry 
is broken (the direction of the staggered magnetization has been locked) which can be 
strictly true only in the limit N ^ oo (unless some symmetry-breaking field is added to 
H). This is fine, however, because we will anyway take the limit A/^ — > <» at the end. 

The boson hamiltonian (7) can be easily diagonalized (i.e., written in terms of number 
operators). To construct this solution, we first Fourier transform, 

ak = AT- £6*-^, flr = iV-^/^£e-*-"flk, (8) 

r k 

where the real- space operators have been labeled by their lattice coordinate vectors r 
instead of just the site index i, and the momentum k is in the first Brillouin zone of the 
square lattice (i.e., the reciprocal square lattice of A'^ sites). The hamiltonian is then 

H = -INS^J + 4S7£nk + 257 £ 7k(aka_k + a^a\), (9) 
k {ij) 

where, with the lattice constant set to 1 (i.e., kx,ky — n2%/L, n e {0, . . . ,L — 1}), 

% = l[cos(kx) + cos{ky)]. (10) 

The next step is to carry out a Bogolubov transformation to boson operators mixing the 
original -|-k and — k operators; 

ttk = cosh(0k)ak -I- sinh(0k)«lk' ^) 

which has the inverse 

Ok = cosh(0k)ak - sinh(0k)«ll!^k- (1^) 

It is easy to verify that the operators ock obey the standard bosonic commutation relations 

for any 0^. The trick is to choose these "angles" for each k such that all operators of the 
form oCkOC-k and a^a\ cancel out in the hamiltonian (9). This is the case if 

2cosh(0k)sinh(0k) 



cosh2(0k)-Fsinh2(0k) 



^Jk- (13) 



The Bogolubov-transformed hamiltonian (the spin wave hamiltonian) is then diagonal 
in the occupation numbers; 

H ^Eo + Y,(OkCC^ccy„ (14) 

k 

where, after some algebra making use of Eq. (10), the constant can be written as 

Eo = -257£ - 2NS^J, (15) 
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and the dispersion relation in (14) for the spin wave states a^|0) is given by 



(Ok = ASJ^I-yI (16) 

For momenta close to (0,0) and {tz^k), this dispersion is linear, (Ok, = ck and (Ok = 
c|(;r, ;r) — k|, respectively, with velocity (the spin wave velocity) c = 2\/2SJ. 

The ground state |0) of (14) is the vacuum for spin waves, where the energy is just Eq 
given by (15). The sum can be evaluated numerically, most easily by a straight-forward 
summation over the momenta on large lattices, and extrapolating Eq/N to N ^ oo (or 
converting the sum divided by to an integral, the numerical evaluation of which gives 
the = oo value directly). The resuh is Eq/JN = -0.65795 for 5 = 1 /2. 

Note that while the ground state does not contain any Bogolubov a-bosons (spin 
waves), it does contain some amount of the original o-bosons. The sublattice magne- 
tization is directly related to the density of these bosons, which is uniform and can be 
computed at any site or averaged over the sites; 

1 ^ 

(m,) =S- {0\4ai\0) =S--Y, {0\4ai\0) . (17) 

(=1 

Using the Bogolubov transformation, this becomes 

(m,)-5-i£sinh2(0k). (18) 
^ k 

In the most interesting case of 5 = 1 /2, this evaluates to (m^) = 0.3034, or 61% of the 
"classical" value 1 /2. Thus, the quantum effects (zero-point fluctuations, represented by 
the presence of of bosons) reduce, but do not destroy, the long-range order. 

In principle it is not clear whether spin wave theory should be reliable for small S. 
There has been much discussion of this issue, but the method does in fact give a good 
description of the 2D Heisenberg model on the square lattice. As we will see later, 
unbiased QMC calculations give results for Eq and (nis) differing from those quoted 
above by only 1 — 2%. This can be traced, a posteriori, to the true value of (my) being 
quite large (i.e., spin wave theory can be expected to be accurate when the density of 
bosons is low). In cases where the true sublattice magnetization is small or vanishes, 
spin wave theory normally does not work that well, even when going to higher orders in 
1 /S (which increases the complexity of the calculation very significantly [40, 41, 42]). 



2.1.2. Destruction of the Neel order 

When going beyond bipartite lattices with uniform interactions, or by supplementing 
the Heisenberg model with additional interactions (e.g., including more than two spins) 
the quantum fluctuations can become so significant that the ground state loses its long- 
range Neel order (retaining only short-range antiferromagnetic correlations). There are 
several other possible types of ordered and disordered ground states, some of which have 
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no classical counterparts. Much of the current interest in quantum spin models is related 
to the existence of non-magnetic states and quantum phase transitions between them 
and the Neel state [2]. This is also the main theme of the numerical calculations to be 
discussed in these lecture notes. One long-standing motivation for studying such transi- 
tions stems from the cuprate high-Tc superconductors, the undoped parent compounds 
of which are antiferromagnets corresponding closely to weakly coupled Heisenberg lay- 
ers (many properties of which can be understood based on a single layer) [6]. In these 
systems the magnetic order is destroyed upon doping by mobile charge carriers. This is 
a very challenging electronic many-body problem, where computational studies are also 
playing an important role [43, 44, 45]. While the full solution of the high-Tg problem 
will of course require more complicated models (perhaps some variety of the t-J or Hub- 
bard model), some generic aspects of the physics close to a quantum phase transition 
out of the Neel state can, however, be understood based on spin-only models [46]. 

Apart from the cuprates and related antiferromagnetic systems, there are also many 
materials with non-uniform or frustrated spin interactions [11, 12, 13], which can lead 
to non-magnetic low-temperature states. Many of these states, and the possible quan- 
tum phase transitions between them, are still not well understood. It is therefore useful 
to search for and study prototypical quantum spin models that realize various types of 
ground states and quantum phase transitions. Studies of quantum phase transitions also 
have a broader context of understanding "exotic" manifestations of quantum mechanics 
at the collective many-body level [2]. There are even interesting connections with par- 
ticle physics — close analogies exist between phase transitions in supersymmetric gauge 
theories in 2-t-l dimensions and "deconfined" quantum-critical points that may separate 
the 2D Neel and VBS ground states [18]. 



2.2. Spin liquids and valence-bond solids 

Within linear spin wave theory, a non-magnetic state corresponds to boson density 
(n,) = S. This kind of state is, however, not a good representation of actual non-magnetic 
ground states of Heisenberg and related quantum spin models, because it does not 
contain any correlation effects. To discuss more interesting non-magnetic states, it is 
useful to first look at the quantum fluctuations from a different perspective. 

For two isolated spins /, j (a dimer, N = 2), the ground state of the 5=1/2 antiferro- 
magnetic Heisenberg hamiltonian (1) is the singlet; 

It,-;,-) - IM/) 

Although the two spins in such a singlet are always perfectly anti-correlated (entangled), 

the individual spins are strongly (maximally) fluctuating, and there is no static spin 
order; (S,) = (Sj) = 0. In contrast, the perfect Neel states forN = 2, It/ij) and li/tj)? are 
product states (i.e., of the form |<^,) (g) |(^y)) with no fluctuations (and no entanglement — 
loosely speaking, the degree of entanglement corresponds to the deviations from a 
product state). Note that for = 2 the (eigen) energy of the singlet is — 37,j/4, whereas 
the expectation value of the energy in the Neel states is much higher, —Jij/4 (and the 



Computational Studies of Quantum Spin Systems 



January 18, 2011 



12 




FIGURE 2. Valence-bond states in two dimensions. The thick lines represent singlets. The arrows show 
a spin configuration compatible with the valence bonds (i.e., the spins on each bond are antiparallel — the 
valence-bond state is a superposition of all such spin configurations). A valence-bond state with only short 
bonds (or any superposition of such states), as shown here in the extreme case of all bonds of length one, 
has no magnetic order in two dimensions. There can, however, be order in the valence bonds, (a) and (b) 
represent typical configurations of a disordered RVB spin liquid and a columnar VBS, respectively. 



States are not eigenstates). The tendency of interacting spins to entangle by forming pair- 
wise singlets to minimize the energy remains in multi-spin systems, but when N > 2 a 
spin cannot simultaneously form pure singlet pairs with all its neighbors. The system 
can instead be thought of as a superposition of different singlet pairings. No pair is 
then in a pure singlet, and the energy contribution from each interaction Hij is therefore 
always higher than the singlet energy —37,^/4. This can be regarded as a reduction of 
quantum fluctuations (leading to less than maximal two-spin entanglement) for N > 2, 
bringing the state (or, more correctly, the density matrix) of each interacting pair closer to 
a product state. A state with antiferromagnetic long-range order, breaking the rotational 
invariance of the hamiltonian, can form in the thermodynamic limit if there are enough 
fluctuations in the singlet pairings (and note that, from the perspective of singlets, the 
Neel state has larger fluctuations than, e.g., a state of fixed singlet pairings — to be 
precise, fluctuations should always be specified with respect to some reference state). If 
the system is one-dimensional, or if the interactions are strongly frustrated (competing), 
or if the lattice geometry and couplings favor the formation of singlets in a specific 
pattern (e.g., in a system of weakly coupled dimers), antiferromagnetic order may not 
be present in the ground state when N 

Valence-bond states. The above intuitive picture of a state of fluctuating singlets 
can in fact be made rigorous. Any singlet state can be expanded in basis states that are 
products of singlet pairs, or valence bonds. Denoting by (/, j) a singlet of spins / and j, 
as in Eq. (19), a normalized valence-bond basis state for (even) spins is of the form 

\^)=N-'/\hJi){i2j2)---{iNllJNl2)), (20) 

where each site index appears exactly once (i.e., each spin belongs to one singlet). 
This basis is over-complete in the singlet subspace, and, thus, any singlet state can be 
expanded in these states, but the expansion coefficients are not unique. The valence-bond 
basis and computational methods using it are discussed in Refs. [47, 48, 49, 50]. Here 
we just note that some types of states are more naturally expressed in the valence-bond 
basis than in the standard basis of individual t and J, spins. 

For the valence-bond basis to be (over-)complete, arbitrary lengths of the bonds must 
be allowed. Restricting the lengths renders the basis incomplete (although one can 
actually restrict the bonds to only connect sites on two different sublattices). Some types 
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of states are still completely dominated by short bonds (i.e., the probability of a bond 
of length r decreases rapidly with r). Such short-bond states in two dimensions have 
no magnetic long-range order (while in three dimensions they can be Neel ordered) 
and are often called spin liquids or resonating valence-bond (RVB) states. Various 
types of crystalline order can also form in the bond configurations, leading to periodic 
modulations in observables such as (Sj • S^) (where /, j are nearest-neighbor sites). Such 
ordered states (which break lattice symmetries) are called valence-bond solids (VBSs), 
or valence-bond crystals. Representative configurations of valence bonds in RVB and 
VBS states are illustrated in Fig. 2. 

2.3. One-dimensional systems 

ID systems are rather special and deserve their own introduction. Studies of quantum 
spin chains date back to Bethe's exact solution of the 5 = 1/2 Heisenberg model, which 
was published in 1931 [51] and worked out in greater detail some time thereafter [52]. 
The solution is very complicated (often requiring complex numerical calculations [53, 
54]), however, and many properties of the heisenberg chain were only obtained much 
later and with complementary methods (in particular, renormalization-group treatments 
of effective low-energy field-theories [55, 56, 57, 58, 59]). 

As we have already noted, there can be no magnetic order in a ID Heisenberg sys- 
tem (but VBS order is allowed, since it breaks a discrete symmetry). The spin correlation 
function C(r) = (S/-S,_|_r) of the Heisenberg chain decays with the distance r as {—ly/r 
(with a multiplicative logarithmic correction, which we will discuss later). Thus, the 
ground state is critical (or quasi long-range ordered, on the verge of ordering). Includ- 
ing a frustrated second-nearest-neighbor interaction causes, when sufficiently strong, a 
quantum phase transition into a VBS state (some times also called the spin-Pierls state), 
where the spin correlations decay exponentially with distance [60]. 

While quasi- ID antiferromagnets were actively studied experimentally already in the 
1960s and 70s, these efforts were further stimulated by theoretical developments in the 
1980s. Haldane conjectured [55], based on a field-theory approach, that the Heisenberg 
chain has completely different physical properties for integer spin {S = 1,2,...) and 
"half-odd integer" spin {S — 1/2, 3/2, . . .). It was known from Bethe's solution that the 
5=1/2 chain has a gapless excitation spectrum (related to the power-law decaying 
spin correlations). Haldane suggested the possibility of the 5 = 1 chain instead having a 
ground state with exponentially decaying correlations and a gap to all excitations; a kind 
of spin liquid state [26]. This was counter to the expectation (based on, e.g., spin wave 
theory) that increasing S should increase the tendency to ordering. Haldane's conjecture 
stimulated intense research activities, theoretical as well as experimental, on the 5 = 1 
Heisenberg chain and ID systems more broadly. There is now completely conclusive 
evidence from numerical studies that Haldane was right [61, 62, 63]. Experimentally, 
there are also a number of quasi-one-dimensional 5=1/2 [64] and 5=1 [65] (and 
also larger 5 [66]) compounds which show the predicted differences in the excitation 
spectrum. A rather complete and compelling theory of spin-5 Heisenberg chains has 
emerged (and includes also the VBS transitions for half-odd integer 5), but even to this 
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FIGURE 3. Temperature (T) or coupling (g) dependence of the order parameter (e.g., the magnetization 
of a ferromagnet) at a continuous (a) and a first-order (b) phase transition. A classical, thermal transition 
occurs at some temperature T ~ Tc, whereas a quantum phase transition occurs at some g — gc atT = Q. 



date various aspects of their unusual properties are still being worked out [67]. There are 
also many other variants of spin chains, which are also attracting a lot of theoretical and 
experimental attention (e.g., systems including various anisotropics, external fields [68], 
higher-order interactions [69], couplings to phonons [70, 71], long-range interactions 
[72, 73], etc.). In Sec. 4 we will use exact diagonalization methods to study the 5=1/2 
Heisenberg chain, as well as the extended variant with frustrated interactions (and also 
including long-range interactions). In Sec. 5 we will investigate longer chains using the 
SSE QMC method. We will also study ladder-systems consisting of several coupled 
chains [9], which, for an even number of chains, have properties similar to the Haldane 
state (i.e., exponentially decaying spin correlations and gapped excitations). 



2.4. IVIodels with quantum phase transitions in two dimensions 

The existence of different types of ground states implies that phase transitions can 
occur in a system at T = as some parameter in the hamiltonian is varied (which 
experimentally can be achieved in quantum magnets, e.g., as a function of pressure 
or external magnetic field [74] — it should be noted, however, that the possibilities are 
more limited than in models). As in classical, temperature driven phase transitions, 
such quantum phase transitions can be continuous or first-order, as illustrated in Fig. 3. 
Normally a phase transition is associated with an order parameter, which is zero in 
the disordered phase and non-zero in the ordered phase, e.g., the magnetization m of a 
ferromagnet or the sublattice magnetization of an antiferromagnet. In Sees. 4 and 5 
we will also discuss examples of order-order (Neel-VBS) transitions. Continuous phase 
transitions are associated with scaling and universality, which we will discuss in more 
detail in Sec. 3. In these lecture notes we will not discuss topological phase transitions, 
which are not associated with any local order parameter. The different states across such 
transitions are distinguishable only through some global topological quantity [76]. 

In this section we introduce some spin models exhibiting quantum phase transitions 
in two dimensions. As we have already seen, the 2D 5 = 1/2 Heisenberg model with 
nearest-neighbor interactions has Neel order at T = 0. The question is then how to 
destroy this order and create some different kind of ground state. Specific studies based 
on QMC calculations will be presented in Sec. 5. The discussion here serves as an 
introduction to the kind of models and physical quantities that we will have in mind when 
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FIGURE 4. Dimerized systems with two different coupling strengths between nearest neighbors; a 
bilayer (a) with the dimers across the layers and single layers with columnar (b) and staggered (c) dimers. 



discussing computational techniques. Some QMC results will be shown for illustration 
purposes already in this section, but at this point we do not need to worry about how they 
were obtained — it suffices to say that the data are numerically exact (to within statistical 
errors that are in most cases too small to be discerned in the figures). 

There are also ID analogues of most of the transitions discussed here and some of 
them will be discussed in Sec. 4.3. Systems in three dimensions can exhibit transitions 
similar to those in two dimensions, but most computational research is currently focused 
on ID and 2D systems, partially because 3D systems are much more challenging. The 
quantum fuctations are normally also more prominent in ID and 2D systems, and some 
of the most interesting open questions related to real materials are associated with the 
quasi-2D nature of the systems (although there are also interesting 3D systems [74]). 



2.4.1. Dimerized systems 

The perhaps simplest way to obtain a non-magnetic ground state of a 2D Heisenberg 
model is to dimerize the system [75], i.e., to introduce weak and strong antiferromagnetic 
couplings (bonds), 7i > and J2> J\> 0, respectively, in a pattern such that each spin 
belongs exactly to one strong bond. There are several ways to do this, three of which 
are illustrated in Fig. 4. When J\ = 0, these systems consist of N/2 independent pairs 
of spins (dimers) with intra-dimer coupling J2, and, as we have already discussed, the 
ground state of each such a pair is a singlet; thus the ground state of the whole system 
is a singlet-product (valence-bond) state, which clearly has no magnetic order. On the 
other hand, for J2 = Ji the ground state has Neel order. The question is then how the 
ground state evolves as a function of the coupling ratio g = Ji/Ji - One might perhaps 
think that some amount of Neel order should appear once the dimers are coupled, i.e., 
for any ^ < 00. It turns out, however, that there is actually a phase transition at some 
critical value gc (which depends on the dimer arrangement). While this transition can be 
analyzed using several analytical and numerical approaches, finite-size scaling of QMC 
data is currently the only way to obtain unbiased quantitative results (see, e.g., [77] for 
a discussion of how spin wave theory breaks down close to the phase transition). 

Fig. 5 shows some QMC results for the columnar dimer model of Fig. 4(b). The order 
parameter is the staggered magnetization, with the corresponding operator defined in 
Eq. (3). This is a vector operator, and, for a finite lattice, its expectation value vanishes 
due to the spin-rotational symmetry of the hamiltonian. Its square, (m^), was computed 
in the simulations. Fig. 5(a) shows results for L x L lattices versus the coupling ratio 
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FIGURE 5. QMC results for the squared sublattice magnetization in the two-dimensional Heisenberg 
model with columnar dimerization. (a) shows results versus the coupling ratio g for different lattice sizes 
and (b) shows the size dependence for several values of g. A quantum phase transition where the Neel 
order vanishes occurs at g w 1 .9. 



g, and Fig. 5(b) shows data for several values of g graphed versus 1/L (which is 
often the most convenient way of graphing data when examining the convergence to 
a non-zero value for L — ^ oo). Here it is clear that the behavior changes at ^ 1.9; 
below this coupling the sublattice magnetization extrapolates to a non-zero value in the 
thermodynamic limit, whereas for larger g it decays to zero. 

In the Neel state, the leading finite-size corrections to (m^) are o« 1/L (a result which 
can be obtained using spin wave theory [78, 79]). This can be seen quite clearly in 
Fig. 5(b) for ^ = 1 and 1.5. The extrapolation for ^ = 1 (see Ref. [50], from which the 
results for this coupling are taken) gives (nis) = 0.3074. This is only about 1% higher 
than the linear spin wave result, (m^) = 0.3034. As discussed above in Sec. 2.1.1, the 
success of spin wave theory for this model can be explained a posteriori by the fact that 
the quantum fluctuations are rather weak, reducing the sublattice magnetization only 
by about 40% from the classical value. For larger g, the agreement between spin wave 
theory and QMC results quickly becomes much worse. In this kind of dimerized model 
linear spin wave theory typically over-estimates the critical g [80, 81, 77], while in other 
cases, such as a system with uniform couplings with different strengths in the x and 
y lattice directions (coupled chains) linear spin wave theory predicts a critical point at 
non-zero coupling when, in fact, there is none [82]. Various improvements can be made 
of the spin wave theory (going to higher order in 1/5, incorporating self-consistency, 
etc. [6, 40, 41, 80, 77, 83]), but a system at or close to a quantum phase transition cannot 
be captured correctly by these approximative methods. 

An important question regarding the quantum phase transition in dimerized systems 
is its universality class (a concept to be discussed further in Sec. 3). A quantum many- 
body system in d dimensions can formally be mapped, using path integrals, onto an 
effective classical system in 2-1-1 dimensions (as we will discuss in Sec. 5), and effective 
continuum field-theories can be constructed for the low-energy behavior, including the 
quantum phase transition. Many properties have been predicted in this way based on 
renormalization-group treatments of one such field theory — the nonlinear a-model in 
2-1-1 dimensions [5, 84]. Based on symmetry arguments alone, one would then expect 



Computational Studies of Quantum Spin Systems 



January 18, 2011 



17 




FIGURE 6. Ising (left) and planar (right) spin configurations with minimum energy on a triangle with 
antiferromagnetic nearest-neighbor interactions. 



the transition to be in the universality class of the 3D classical Heisenberg model. There 
are, however, subtle issues in the quantum-classical mapping, and QMC simulations are 
therefore needed to test various predictions. We will see examples of such comparisons 
between results of simulations and field theories in Sec. 5. While results for the transition 
in the bilayer (a) [85] and columnar dimer (b) [86] systems in Fig. 4 (and several 
other cases [87, 88]) are in good agreement with the expectations, recent studies of 
the staggered dimers (c) show unexpected deviations [89] that are still not understood. 



2.4.2. Frustrated systems 

The prototypical example of frustration is a system with antiferromagnetic inter- 
actions on a triangular lattice. Looking at this problem first within the Ising model, 
the spins on a single triangle cannot simultaneously be anti-parallel to both their 
neighbors — there are six configurations with minimum energy, and these all have one 
"frustrated" bond (two parallel neighbors), as shown in Fig. 6. Being a consequence of 
the lattice, this is often referred to as geometric frustration. Upon increasing the system 
size, the ground-state degeneracy grows with the system size, and in the ensemble in- 
cluding all these configurations there is no order of any kind [90, 91]. In the case of the 
classical XY (planar vector) or Heisenberg (vectors in three dimensions) model, there is, 
however, order at T = (but not at T > 0, according to the Mermin- Wagner theorem). 
The energy is minimized by arranging the spins in a plane at 120° angle with respect 
to their neighbors on the same triangle, as shown for a single triangle in Fig. 6. This is 
referred to as a three- sublattice Neel state. There have been many studies of the 5=1/2 
variant of this model. This was, in fact, the system for which the RVB spin-liquid state 
was initially proposed [92]. There is now, however, compelling numerical evidence for 
the three- sublattice Neel order actually surviving the quantum fluctuations [93, 94]. 

On the square lattice, frustration effects can be investigated by adding interactions 
beyond nearest neighbors. A well studied case is the J1-J2 model, where J\ and J2 refer, 
respectively, to the strengths of the nearest- and next-nearest-neighbor interactions. The 
system is frustrated if both 7i > 0,72 > or if 7i < 0,72 > 0. Even with Ising spins, 
this is a highly non-trivial system, with unresolved questions still attracting interest 
[95, 96, 97]. We will discuss the frustrated Ising system further in Sec. 3.4. 

In the case of quantum spins, the S = 1/2 J1-J2 Heisenberg model with all antiferro- 
magnetic couplings is one of the prototypical models with a quantum phase transition 
out of the standard Neel state, in this case as a function of the coupling ratio g = Ji/Ji- 
While many different calculations show rather consistently that the Neel order vanishes 
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FIGURE 7. The square-lattice 5=1/2 Heisenberg model with only nearest-neighbor interactions 
7i > has an antiferromagnetic ground state, in (a) illustrated by open and solid circles for (S^) > 
and (S^) < 0. In (b), next-nearest-neighbor interactions 72 > are shown as dashed lines. When 0.4 < 
Ji/Ji < —0.6 (approximately) the ground state may be a columnar VBS, with (5?) = on all sites but 
modulations in the bond correlations (S, • Sj) (where /, j are nearest-neighbors) as shown with thicker lines 
for the more strongly correlated bonds forming a columnar pattern. For J2/J1 > 0.6, the ground state has 
collinear (striped) magnetic order, as shown in (c). 



at a critical coupling ratio gc ^ 0.4 [98, 99, 100, 101, 102, 103, 104], the order of the 
phase transition and the nature of the non-magnetic ground state are still controversial 
issues. Most studies indicate some type of VBS state, with a columnar one being the 
prime candidate, but an RVB spin liquid state has also been proposed [105]. 

For larger g, above ^ ~ 0.6, there is again magnetic order. This can be understood in 
the limit g ^00, where the system decouples into two separate square-lattice Heisenberg 
antiferromagnets. At ^ = 0° (7i = 0) the relative direction in spin space of the antiferro- 
magnetic order within these subsystems is arbitrary, but for any 7i > the subsystems 
lock to each other and form collinear spin order, with vertical or horizontal stripes of 
parallel spins (a state breaking the 90° lattice rotation symmetry, in addition to the global 
spin rotation symmetry). The transition between the non-magnetic and collinear states 
is most likely first-order. 

The three different ground states of the Ji-Jz Heisenberg model with 5=1/2 spins are 
illustrated in Fig. 7. Note that a classical version of this model (Ising, XY, or Heisenberg) 
has a direct (first-order) Neel-coUinear T = transition exactly at ^ = 1/2 (which can be 
easily verified just by computing the energies of those states for g < \/2 and ^ > 1/2). 
The magnetically disordered state is thus induced by quantum fluctuations, and it has no 
direct classical analogue. It is not clear whether this kind of state persists for 5 = 1 or 
higher spins — for some large S, perhaps already S = 1, it must give way to a first-order 
Neel-CoUinear transition, as in the classical system. 

The reason why it has been so difficult to reach a firm conclusion on the nature of the 
non-magnetic state and the quantum phase transition between it and the Neel state is that 
large-scale unbiased computational studies of the S = 1/2 J1-J2 model are currently not 
possible, because of "sign problems" affecting QMC calculations of frustrated systems. 
We will discuss this issue further in Sec. 5. There are no other unbiased method that can 
reach sufficiently large lattices, e.g., exact diagonalization can reach only A'' ~ 40 spins.^ 



Recently developed variational methods based on tensor-product states in principle become unbiased 
in the Umit of large tensors [21]. They have been applied to various frustrated spin models [106]. The 
computational complexity of such calculations is, however, currently too demanding to reach tensors 
sufficiently large to produce unbiased results in practice. 
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2.4.3. The J-Q class of models 



VBS states of quantum spin systems in two dimensions were predicted theoretically 
more than two decades ago [107]. The VBS formation is associated with a spontaneously 
broken translational lattice symmetry. The VBS state and quantum phase transitions 
into it are therefore quite different from the non-magnetic states and transitions in the 
"manually" dimerized systems discussed above in Sec. 2.4.1. Although there is a pattern 
of "strong" and "weak" spin correlations in both cases, the quantum fluctuations in 
systems with manually and spontaneously broken translational symmetry are different 
(being much more interesting in VBS states). Until recently, large-scale computational 
studies of VBS states and Neel-VBS quantum phase transitions had not been carried out 
starting from microscopic hamiltonians, because of the QMC sign problems affecting 
frustrated Heisenberg models (which at first sight are the most natural systems in which 
to explore the physics of non-magnetic states). 

VBS states and the Neel-VBS transition have come into renewed focus with a pro- 
posal by Senthil et al. [16] that this transition is generically continuous and, thus, violates 
the "Landau rule", according to which an order-order transition (between states breaking 
unrelated symmetries) should be first-order (except at fine-tuned multi-critical points). 
In the "deconfined" quantum-criticality scenario, the VBS and Neel order parameters 
are manifestations of spinon confinement and condensation, respectively. Spinons can 
be thought of as 5 = 1/2 degrees of freedom, but not just corresponding to the bare 
individual spins on the lattice sites, but more complex collective objects "dressed" by 
interactions. A non-compact CP^ field theory was proposed to describe such spinons 
coupled to an emergent gauge field (which corresponds to the properties of the valence- 
bond background in which the spinons exist) [16]. A central question is then whether 
this low-energy physics of a continuum field theory really can arise starting from a rea- 
sonable microscopic hamiltonian. Answering this question requires large-scale compu- 
tational studies of models exhibiting Neel-VBS transitions. 

Since the QMC sign problem prohibits large-scale studies of the J1-J2 Heisenberg 
model and other similar frustrated systems, we have to try something else. In the "J- 
Q" class of models [17, 108, 109], the Neel order is destroyed by an interaction (Q) 
which is not frustrated, in the standard sense, but still competes with the Heisenberg (J) 
interaction. To understand these J-Q models, note first that the Heisenberg interaction is, 
up to a constant, equal to a singlet projector operator: Hij = —Sjj + 1, where 

Sij = l-Si-Sj. (21) 

The pair-singlet, Eq. (19), is an eigenstate of this operator with eigenvalue 1, whereas a 
triplet state is destroyed by it; 

Sij\(l>^j} = \(^!j). Si0.f)=O, (m = 0,±l). (22) 

Thus, when Sij acts on a singlet-triplet superposition, only the singlet component sur- 
vives (is "projected out" — note that the property Sfj — Sij required of a projection op- 
erator is satisfied). The standard Heisenberg interaction thus favors the formation of 
singlets on pairs of nearest-neighbor sites, but, as we discussed in Sec. 2.2, the fluctua- 
tions of these singlets among many different pairings of the spins leads to Neel order in 
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FIGURE 8. Graphical representation of possible arrangements of products of singlet-projector oper- 
ators Sij in the J-Q model and its generalizations, (a) is the Heisenberg exchange, (b) the four-spin 
interaction of the original J-Q model, and (c) a six-spin interaction which leads to more robust VBS order. 
These operators, and their 90°-rotated analogues, are summed over all positions on the square lattice. 



the ground state. The idea behind the J-Q models is to project singlets on two or more 
bonds in a correlated fashion, using products of several Sij operators on a suitable set of 
different bonds. This favors a higher density of short valence bonds, thereby reducing or 
completely destroying the antiferromagnetic order. 

The original J-Q hamiltonian [17] on the square lattice can be written as 

H = -JY,S,j-Q S^Ski, (23) 

m m) 

where both the J and Q terms are illustrated in Fig. 8. The Q interaction involves four 
spins on a 2 X 2 plaquette. An interaction with three singlet projectors in a columnar 
arrangement is also shown, and operators with even more projectors, or with the projec- 
tors arranged on the lattice in different (non-columnar) patterns, can also be considered 
[109]. With 7 > and Q>0 [and the minus signs in front of the interactions in Eq. (23)], 
correlated singlets are favored on the lattice units formed by the product of singlet pro- 
jectors. It is still not clear just from the hamiltonian whether a VBS state is realized for 
large Q/J — since the hamiltonian does not break any symmetries (the interactions in 
Fig. 8 are summed over all distinct lattice translations and rotations), a VBS state only 
forms if the hamiltonian also contains in it, implicitly, some effective interactions that 
favor singlets in some ordered pattern. The J-Q model (23) does exhibit a Neel-VBS 
transition, at Qc/J ~ 22 [17, 108, 1 10], and at lower Qc if more than two singlet projec- 
tors are used [108, 109]. We will discuss this in more detail in Sec. 5. Here we just look 
briefly at examples of correlation functions useful for characterizing the Neel and VBS 
states and the quantum phase transition between them. 

Order parameters. As an alternative to the square of the full spatially averaged 
sublattice magnetization (3), we can detect the presence or absence of of Neel order 
using the spin correlation function, 

C(r,,) = (SrS,), (24) 

at long-distances. Fig. 9(a) shows J-Q model results for the largest separation of the 
spins, Tmax = (^/2,^/2), on periodic L x L lattices. The results have to be analyzed 
carefully to determine the transition point, but already this raw data suggest that the 
Neel order vanishes, i.e., C(rinax 

) when L ^ oo, for 7/2 < 0.04. 
VBS order can be detected in the dimer (or bond) correlation function, defined as 

D,,{rij) = (5,(r,)5x(r,-)), (25) 
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FIGURE 9. QMC results for the long-distance spin (a) and staggered dimer (b) correlation functions 
(corresponding the Neel and VBS order, respectively) in the ground state of the J-Q model (23) versus the 
coupling ratio J/Q. A careful analysis of these correlation functions, as well as other quantities, indicates 
a single critical point {J/Q)c ~ 0.045 where both the Neel order and VBS order vanish continuously. 



where the bond operator is given by 

5;,(r,)=S(r,)-S(r, + x). (26) 

Here, instead of using a subscript i on the spin operator for some arbitrary site labeling, 
it is more convenient to use the corresponding lattice position vector r,. Then r,- -fx 
corresponds to the site immediately next to r,- in the positive x direction. The subscripts 
XX in (25) indicate that the two bond operators are both oriented in the x direction, and 
this correlation function is, by symmetry, equal to Dyy on an L x L lattice. One can also 
consider bond correlations D^y, where the two bond operators are oriented differently. In 
a VBS state such as the one illustrated in Fig. 7, one expects Z)xx(r) to exhibit a columnar 
pattern of smaller and larger values. A VBS order parameter can then be defined as a 
suitable difference between these modulated correlations, e.g., 

D;,(r) = D,,(r) - i [D,,(r-x) +D,,(r + x)]. (27) 

This correlation function is shown in Fig. 9(b) at the largest lattice separation for several 
different system sizes. Here it is clear that VBS order exists for 7 = 0, up to 7/2 ^ 0.04, 
roughly where the Neel order sets in. As we will discuss in Sec. 5, all calculations 
so far point to a single critical point, without any intervening third phase or region of 
coexistence of both Neel and VBS order. 

3. CLASSICAL PHASE TRANSITIONS, MONTE CARLO 
SIMULATIONS, AND FINITE-SIZE SCALING 

Many aspects of quantum phase transitions, and how to analyze them based on numerical 
finite-lattice data, are very similar to classical phase transitions. Here we discuss this 
common formalism in the simpler context of classical phase transitions, before turning 
to calculations and data analysis for quantum systems. 
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In classical statistical mechanics the prototypical example of a system with a continu- 
ous phase transition is the 2D Ising ferromagnet, the exact solution [111, 112] of which 
shows rigorously that such a phase transition exists. This model is defined by 

N 

£c7 = -^£a,<7;-/2£<T,-, (28) 

(iJ) '=1 

which is just the (potential) energy as a function of the spins (7/ = ±1. We use o to 
collectively denote an entire spin configuration; a = (ai, . . . , 0)v). With the interaction 
bonds (ij) restricted to nearest-neighbors on the simple 2D square lattice, the exact 
critical temperature for an infinite system is Tc/J = 2/ln(l 4- \/2) ^ 2.269. The order 
parameter is the magnetization (m) = (a,). It has the asymptotic T ^ Tc (T < Tc) 
form moc \t\^, where t is the reduced temperature, t = {T — Tc)/Tc, and the exponent 
/3 = 1/8. Such critical exponents and other aspects of scaling behavior at continuous 
phase transitions will be discussed in this section. 

The exact solution of the Ising model is very special, and normally one studies phase 
transitions in other ways. Critical exponents appear already in simple mean-field theo- 
ries, but their values are typically not correct. Mean-field theory is nevertheless essential 
as a starting point, which we here outline for the Ising model. The most important theo- 
retical framework for phase transitions is the renormalization group, which explains how 
universal (depending only on symmetries and dimensionality, not details of the interac- 
tions) non-trivial exponents (i.e., different from the generic mean-field values) can arise 
(see, e.g., the book by Cardy [15]). To compute critical exponents and other properties in 
an unbiased way, one normally uses Monte Carlo simulations, where spin configurations 
are stochastically sampled according to the Boltzmann distribution.^ In this section the 
basics of Monte Carlo simulations are outlined (for more details, see, e.g., the books by 
Landau and Binder [115] and Newman and Barkema [116]). To motivate and illustrate 
finite-lattice calculations in general, we will also discuss how properties in the thermo- 
dynamic limit can be extracted from Monte Carlo data. 

Section outline. The standard mean-field treatment of the Ising model is discussed 
in Sec. 3.1. In Sec. 3.2 the Monte Carlo method is introduced and used to illustrate 
how phase transitions and symmetry-breaking can occur in practice for large systems 
(formally in the limit of infinite system size). In Sec. 3.3 we review the key aspects 
of critical behavior (defining exponents, etc.) in the thermodynamic limit, then discuss 
the finite-size scaling hypothesis and demonstrate its usage by analyzing Monte Carlo 
results for the 2D Ising model. First-order transitions are discussed in Sec. 3.4, using 
a frustrated 2D Ising model as an example of finite-size scaling methods for detecting 
discontinuities. In Sec. 3.5 the important concept of spin stiffness (which corresponds 
to an elastic modulus of a solid) is introduced in the context of XY (planar vector) 
spin models. The scaling properties of the spin stiffness are illustrated with Monte Carlo 



^ Other important methods include high-temperature series expansions [113] and e-expansions (where 
e = du—d) around systems at their upper critical dimension du [114], where the exponents take their 
mean-field values. Monte Carlo simulation is normally the most reliable method, however. 
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results for the 3D and 2D XY models, the latter of which does not exhibit a normal phase 
transition into an ordered state, but exhibits a different Kosterlitz-Thouless transition 
into a critical low-temperature phase. In Sec. 3.6 we briefly discuss how the classical 
criticality concepts are generalized to quantum phase transitions. 



3.1. Mean-field theory of the Ising model 

In mean-field theories the environment of a subsystem of an infinite lattice is replaced 
by an external field representing the average interactions between the subsystem and the 
environment. The subsystem can be a single spin or a cluster of several spins. Here we 
will just consider the simplest case of a single-spin calculation for the Ising model, i.e., 
the infinite environment of a single spin is replaced by an effective field. 

It is convenient to write the hamiltonian (28) in an extended form with general 
interactions Jij between all the spins (on an arbitrary lattice), and also include an external 
magnetic field; 

N N N 

^cT = - ;j £ £ JijOiGj - /i £ CJ,-. (29) 
i=l j=l i=l 

Note the factor 1/2, which compensates for each interacting pair being included twice 
in the sum (and Ju — 0). We do not impose any restrictions of the signs and magnitudes 
of Jij, but for simplicity we assume that if there is a phase transition, the ordered state is 
ferromagnetic. To construct and motivate the single-spin approximation, we first group 
together all the interactions in (29) involving an arbitrary spin /; 

Ei^-Oi^Y^JijOj + h^. (30) 

Note that there is no factor j in front of the sum here, and we have used 7,y = 7y/. 

We now assume that h > 0, so that m = (a,) > 0. We will investigate spontaneous 
ordering in the absence of the field, by eventually letting h^O. Adding and subtracting 
a constant, we can write the terms within the parentheses in (30) as 

Y,JijOj + h = mY,Jij + h + Y,Jij{Oj-m). (31) 

j j j 

In its most basic formulation, mean-field theory amounts to neglecting the second sum 
in (31) — the fluctuation term — after which we are left with the easily solvable problem 
of a single spin in an effective magnetic field of strength y^m -|- h; 

Ea = -{Jsm + h)G, (32) 

where Jg is the sum of the original couplings 

Js = Y.Jw (33) 

j 
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and we assume a translationally invariant system, so that this sum is independent of i. 
The magnetization m in (32) is at this stage unknown and will be detemiined through a 
self -consistency condition; (o) =m. 

Mean-field theory in the present formulation can be justified if the neglected fluctua- 
tions are much smaller, on average, than the other terms in (31), i.e., if 

4 = M^^«l. (34) 

Here () denotes the expectation value under the actual probability distribution (the 
Boltzmann distribution) of the spins, which we cannot compute exactly. In principle the 

fluctuations can also be calculated within mean-field theory, as an internal consistency 
check. Even without doing any calculations, we can roughly deduce the conditions 
under which 5^ will be small. Clearly, 5^ is small if there is substantial order, i.e., 
when (1 — m) = (1 — (<7/)) <S 1, because then most oj = I m. This is the case if 
h is large. It is also true for = if the symmetry is spontaneously broken and m is 
close to 1, i.e., for T <^Tc if there is a phase transition. Moreover, 5^ can be small 
even if m is not very large, if the sum over j involves many non-zero (and relatively 
large) coupling constants Jij — because of cancellations of fluctuations, the sum T^jJijOj 
will then typically (in most of the statistically important spin configurations) be close 
to mJ^jJij. In the extreme case of infinite-range uniform interactions, Jij — J/N for 
all ij, and N ^ oo (where we regard / as a finite constant, e.g., 7 = 1, in order to 
have a finite energy) all the fluctuations cancel out exactly and ^ = (and the mean- 
field theory is then exact). This holds true exactly also for short-range interactions on 
an infinite-dimensional lattice. Thus, in general, even if m is not large, the fluctuation 
measure 5^ can be expected to be small for systems in high dimensions and/or for long- 
range interactions. These are then the conditions under which mean-field theory can 
be expected to be quantitatively accurate. Even in cases where it is not quantitatively 
accurate, mean-field theory can still provide valuable insights qualitatively. 

Let us now actually solve the mean-field problem (32), i.e., the single-spin problem 
(32) under the self-consistency condition (o) =m. The magnetization is 

= £^,aiJ..^H)/T = ^U"^ + (35) 
and thus the self -consistency condition reads 

m = tanh[(/,m + h) /T] . (36) 

This equation in general has to be solved numerically, which can be done easily using 
successive bracketing of the solution. For small m and h, we can proceed analytically 
by expanding to leading order in m, h. First, when the external field = 0, m = is a 
solution for all T . Looking for other possible solutions, expanding (36) to third order in 
X = Jsm + h, tanh(j:) = x — x^/3, we have 
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FIGURE 10. Mean-field solution of the Ising model, (a) Magnetization versus external field for temper- 
atures above, at, and below Tc- The discontinuity for h — > 0+ and h — > 0^ corresponds to the spontaneous 
magnetization at h = 0. (b) Temperature dependence of the spontaneous magnetization. 



from which we can identify the critical temperature Tc = Js below which the magnetiza- 
tion can be non-zero. The asymptotic T — t- Tc behavior is 

m=(3^^J , (/i = 0, r<r,). (38) 

We can also obtain the /? — )■ field dependence of the magnetization at Tc from (36). 
Keeping the leading-order terms in h of the third-order expansion at T = Js we get 

m=(3-j , {T = Tc). (39) 

It is instructive to also look at the full numerical solution for m. Fig. 10(a) shows some 
examples of the field dependence of m at different temperatures. For T >Tc the behavior 
is analytic across h = 0, with a singularity described by (39) developing as T — )• Tc. 
Below Tc the behavior is discontinuous, which corresponds to a first-order transition 
versus h between the m < and m > states. The discontinuity corresponds to a 
spontaneous magnetization at zero field, the full numerical solution of which is graphed 
in Fig. 10(b) and the asymptotic T — )• T^7 behavior of which is given by (38). Here the 
role of h — )• 0+ or h — )• 0^ in the calculation is to break the degeneracy between the ±m 
solutions. The degeneracy for h = also corresponds to co-existence of the two ordered 
states exactly at the first-order transition, as we will discuss more generally in Sec. 3.4. 

Translating the result Tc = Js with Js given in Eq. (33) gives Tc = 47 for the 2D 
Ising model with nearest-neighbor interactions of strength J. This is well above the 
correct value, Tc/J ^ 2.269. An over-estimation of Tc can be expected on account of the 
neglected fluctuations (which naturally lower Tc). 

The exponents /3 = 1/2 and 5 = 1/3 in (38) and (39) are generically the values of 
these critical exponents within mean-field theories. Other exponents can also be com- 
puted [15]. While power laws are indeed correct generic features of continuous phase 
transitions, the mean-field values are not correct in general. As already mentioned, for 
the 2D Ising model, /3 = 1/8 from Onsager's exact solution. In three dimensions its value 
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in the Ising universality class is /3 ~ 0.33, as determined using Monte Carlo simulations 
and series expansion methods (and also, less precisely, using field-theoretical methods 
and other analytical approaches). The mean-field critical exponents are exact in four and 
higher dimensions (four being the upper critical dimension for the Ising model — the 
dimensionality above which the mean-field critical exponents become exact). We will 
discuss critical behavior in greater detail further below, after developing the technical 
aspects of Monte Carlo simulations to study criticality in practice. 



3.2. Monte Carlo simulations of the Ising model 

In a Monte Carlo simulation, the goal is to generate a sequence of spin configurations, 
(7(1), (T (2), ... , o{K), representing a statistically unbiased sample from the Boltzmann 
distribution, i.e., the probability P{<j) of an arbitrary configuration o to be among the 
sampled ones should be proportional to the Boltzmann weight at temperature T; 

Wa = Q-^'^l'^, (40) 

where we work in units such that ks—^- The actual (properly normalized) Boltzmann 
probability is Pa = Wa/Z, where Z is the partition function 

Z = J^e-^<^/^. (41) 
c 

In Monte Carlo simulations we do not need the full partition function, only the un- 
normalized weights (40). It is also important to note that the sequence a(l), . . . , o{K) 
can be correlated (as we will discuss further below), but we only need the probability 
P{o)'x. Wa and for now do not need to worry abot joint probabilities such as P((7i , 02). 

The Metropolis algorithm. The simplest way to generate a valid sequence of con- 
figurations is with the Metropolis algorithm [117]. This kind of simulation starts from 
an arbitrary spin configuration C7(l) (e.g., randomly generated). Thereafter, each suc- 
cessive o{k+\)is obtained stochastically from its predecessor o{k) according to a few 
simple steps based on flipping randomly selected spins with a probability related to the 
desired distribution. We only need to store the current configuration and from now on 
suppress the "time" index k. We denote by the configuration obtained when the iih. 
spin of o has been flipped; a^' = (ai, . . . , —Oi, . . . , Ojsi). Normally one defines a Monte 
Carlo sweep as A'^ such flip attempts, so that oc N spins are flipped, on average, during 
this size-normalized unit of simulation time. A Monte Carlo sweep can be carried out 
according to the following simple algorithm:'* 



This pesudocode segment resembles a computer language such as Fortran, but with mathematical nota- 
tion making it easier to read. Such pseudocodes will be used throughout these notes to illustrate imple- 
mentations of algorithms (showing the essential steps, but without trivial details that may in practice make 
real working code somewhat longer). Numbers in curly brackets, here {1}, will be used to label codes. 
In most cases the syntax will be rather self-explanatory and does not require any detailed discussions of 
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doj=l,N 

/ = random [1,...,A^] {1} 
if (random[0 - 1]) < W„-i/Wa) Oi = -Oi 
enddo 

This Metropolis algorithm is based on the detailed balance principle, which is a general 
theorem for a stochastic process (a Markov chain in some arbitrary configuration space) 
that should generate a probability distribution W. By this we mean, roughly speaking, 
that the set of sampled configurations should approach the distribution W as the number 
of configurations is increased, independently of the initial condition. Denoting by P{A — )■ 
B) the transition probability of "moving" to configuration B if the current one is A, 
the detailed balance principle states that the desired distribution is generated if all the 
transition probabilities satisfy the condition 

P{A^B) ^ 

P{B^A) W{Ay ^ ^ 

for all pairs of configurations A, 5 for which P{A B) > 0. In addition, the sampling 
should be ergodic, i.e., any configuration C with non-zero weight W{C) must be reach- 
able, in principle, with non-zero probability through a series of moves starting from an 
arbitrary configuration. 

In the Metropolis algorithm, as implemented in code {1}, the transition probability 
consists of two factors; 

P((7 ^ a"') = Aelect(0^accept(c7"') , (43) 

where Pseiect(0 is the probability of randomly selecting spin which here always equals 
l/N. The full transition probability in (42) can therefore be replaced by the probability 
^accept of actually carrying out (accepting) the spin flip. This probability is not unique. 
In the Metropolis algorithm it is taken as 



^accept(<y ') =min 



W i 



(44) 



One can easily confirm that this satisfies the detailed balance condition (42). A proba- 
bility formally has to be < 1, and this is taken care of above with the "minimum of" 
function. In code {1}, comparing the weight ratio with a random number in the range 
[0, 1) automatically achieves the same result. Note that the ratio of the Boltzmann fac- 
tors depends only on the spins interacting with the flip-candidate (7, (in the simplest case 
just its nearest neighbors) and can be rapidly evaluated. In more physical terms, a spin 
flip leading to a lower energy is always accepted, whereas a configuration with higher 
energy is accepted only with probability /accept (o'') = exp[(^cT — ^^o- 0/^]- 



definitions. In the segment above, and in many other ones to follow, random denotes a random-number 
generator, with the square brackets [1, . . . ,iV] and [0 — 1] indicating the range of uniformly distributed 
integers and floating-point numbers, respectively. 
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FIGURE 11. Time series for the magnetization generated in Monte Carlo simulations of the Ising model 
on L X L lattices with L = 8 (top) and L = 16 (bottom) at temperature T/J = 2.2 (< Tc « 2.269). In both 
cases the starting configuration was fully polarized, m = 1, and the subsequent points are separated by 
N = 1? Metropolis spin flip attempts (constituting one Monte Carlo sweep). 

The Metropolis algorithm leads to the correct distribution after some transient time, 

which depends on the initial configuration, the temperature, and the system size. In a 
simulation one should therefore discard some number of configurations before "measur- 
ing" physical observables. After this equilibration, measurements are normally carried 
out after every or every few sweeps. 

In the case of the ferromagnetic Ising model, the most important quantity to calculate 
is the magnetization. It should be computed using all the spins to take advantage of 
self-averaging to improve the statistics. Thus, we will normally use 



We will here consider only simulations with the external field h = 0. We have already 
discussed that fact that relevant symmetries of the hamiltonian (here the discrete spin- 
inversion symmetry) are not broken in simulations of finite systems. For the Ising model, 
one should therefore compute spin-inversion invariant expectation values such as (m^) 
or (|m|) in order to detect the phase transition into the ferromagnetic state. 

Symmetry-breaking and finite systems. To obtain a qualitative understanding of how 
symmetry-breaking in the thermodynamic limit is manifested in practice for large lat- 
tices, it is useful to first look at an actual Monte Carlo time series for m. Examples for 
two small systems at a temperature below Tc are shown in Fig. 11. Here one can see that 
the magnetization fluctuates between positive and negative values, and that the typical 
time taken to reverse the sign of m is longer for the larger system. Plotting the series over 
a longer time makes this clearer, but note that the time the system spends close to m = 
is visibly much smaller for L— 16 than L = 8, and this is of course directly related to a 
typically longer reversal time. It is clear from results such as these that the distribution 
of m values is peaked at non-zero positive and negative values for T < Tc. For T >Tc the 



m = — 




(45) 
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FIGURE 12. Magnetization distributions for LxL Ising models with L = 16 (left) and 64 (right) at two 
temperatures below (T /J — 2.2) and above (T /J = 2.6) the critical temperature Tc/J ~ 2.27. 



distribution is instead a single peak centered at m = 0. Examples of such distributions 
are shown in Fig. 12. In thermodynamics language, these two qualitatively different dis- 
tributions can be understood as a consequence of the free energy F{m) =E{m) — TS{m) 
at low T being dominated by the internal energy E (which is low in large- |m| configura- 
tions), and at high T by the entropy S (which is large for small |m|). 

A double-peaked magnetization distribution at low temperature suggests that even a 
finite system can be considered as ordered, although the spin-inversion symmetry is not 
broken in the simulations. It appears very plausible that the typical reversal time should 
diverge with L (and it is not difficult to check this with simulation data for several L 
values and a suitably definition of a reversal), and then no reversals would take place for 
large L, even during very long simulations. The reason for this divergent time scale is 
that, in order for the magnetization to reverse, a series of local spin flips must necessarily 
take the system through many configurations with m ^ 0, which have increasingly 
high energy for increasing system size (and, thus, lower Boltzmann probability). For 
a large system the distribution for T <Tc h therefore only sampled among the subset 
of configurations with fixed sign of m — the stochastic process in practice becomes non- 
ergodic. Broken symmetry and non-ergodic sampling are manifested strictly only for 
N = oo, but in practice also for large but finite systems on time scales less than the 
typical magnetization reversal time. This time scale of course depends on the details of 
how the spin configurations are thermally sampled — in Monte Carlo simulations and in 
a real magnets — but diverges as N oo for any local sampling scheme. 

Normally, in Monte Carlo simulations one does not investigate the time series and 
the full distribution of physical quantities (although some times this is useful). By 
computing (m^) or (|m|), we do not have to worry about the time scale of reversals. 
We normally want to extrapolate finite-A'^ results to the thermodynamic limit, where 
|(m) I = (|m|) = (m^) (but note that (m^) ^ for finite A^, because of the finite 
width of the peaks in the m-distribution). 

Autocorrelations and statistical errors. Before calculating expectation values, we 
have to discuss how to analyze Monte Carlo data statistically. Consecutive configura- 
tions generated with the Metropolis algorithm are not statistically independent — only 
configurations separated by a number of sweeps much larger than the autocorrelation 
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time are statistically independent. The autocorrelation function for a quantity Q is de- 
fined as 

where t and / denote simulation time, normally in units of the Monte Carlo sweeps 
defined above, and averages are over the reference time /. The normalization is such that 
Aq{0) = 1 and Ag(? — 7> oo) = 0. The asymptotic decay is exponential, Ag(?) ~ e^^/^2^ 
which can be used to the define the autocorrelation time Tq. Normally one instead uses 
the integrated autocorrelation time, which also contains contributions from the often 
dominant non-asymptotic behavior; 

-J oo 

^2'= 9 + 1^2(0- (47) 

^ f=i 

Here we will not discuss autocorrelations at length, but only summarize their underlying 
role in determining the statistical precision ("error bars") of computed quantities. 

The autocorrelation time for m of the Ising model roughly corresponds to the typical 
time between magnetization reversals (as in Fig. 11). Other quantities, such as and 
|m|, that are not sensitive to the sign of m, have shorter autocorrelation times (however, 
often much longer than a single Monte Carlo sweep). A long autocorrelation time does 
not bias a computed average (i.e., it is not wrong to measure Q after every Monte Carlo 
sweep even if Tg S> 1), provided that the total simulation time is much longer than Tg. 
The autocorrelation time does, however, come into play (explicitly or implicitly) when 
computing statistical errors. 

To calculate the statistical errors, one can subdivide a simulation into a number B 
of bins, each containing some number M of Monte Carlo sweeps. For some quantity 
Q, averages Qh, b = I, ... ,B are computed over each bin, and the final average Q and 
error bar Oq (one standard deviation of the average of the bin averages) are calculated 
according to 

e=il&, <'a = 5(^£(&-e)^. (48) 

The final estimate of the true expectation value (Q) should then be quoted as Q±Oq. 

The reason for binning the data is that, according to the central limit theorem, the 
distribution of bin averages is Gaussian for large M (unlike the distribution of individual 
measurements, as seen clearly for the magnetization in Fig. 12), and the computed error 
bar then has a well defined unique meaning (e.g., we know that the probability of (Q) 
being within one error bar of Q is about 2/3). This is only true if the bin length M is also 
much longer than the autocorrelation time, so that the bin averages can be regarded as 
statistically independent. If th at is the case, the error bar should depend only on the total 
number of sweeps; (Tg oc 1/ ^/MB, where the factor of proportionality is °« ^/Tq (and of 
course also depends on the detailed form of the distribution of the individual measured 
Q values). It is not necessary to calculate Tq explicitly. If there is any doubt about the 
bins being sufficiently long, one can check this by using a rather large number of bins 
(e.g., in the range 100-1000) and saving all the bin averages on disk. The data can then 
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be re-binned into longer bins post-simulation, and the convergence of Oq as a function 
of the bin length can be tested. Saving the bin averages on disk is always advisable, not 
only for the purpose of analyzing the error bars, but also in order to make it easy to add 
more data at some later time (to improve the results, if needed). 

An important point that has to be mentioned is that the autocorrelation time of any 
local updating scheme, e.g., the Metropolis algorithm, diverges for T ^ Tc and N ^oo. 
We previously saw an example of this in the magnetization reversal time of the Ising 
model, but divergent autocorrelation times also affect any quantity (in particular and 
\m\) that is sensitive to the fluctuations of the magnitude of m (which in practice is the 
case for most quantities of interest). It is therefore difficult to obtain good results for large 
systems close to a critical point. In many cases, including the Ising model, this problem 
can be solved (or almost solved) by using cluster algorithms [118, 1 19], where clusters 
of spins (constructed so as to satisfy detailed balance) are flipped collectively (instead of 
flipping individual spins one-by-one). While we will consider an analogous loop-cluster 
algorithm for QMC simulations in Sec. 5, we do not have to discuss classical cluster 
algorithms here; they have been described extensively in the literature (e.g., in [115] 
and [116]). It should be noted, however, that the Ising results to be discussed below, for 
systems with up to 1024^ spins, were in fact obtained using a cluster algorithm. It would 
be very difficult to generate (within reasonable time) data of the same quality using the 
Metropolis scheme for such large systems in the critical region. 



3.3. Finite-size scaling and critical exponents 

Fig. 13(a) shows the temperature dependence of the squared magnetization (m^) of 
the 2D Ising model for several lattice sizes. An increasingly sharp feature develops with 
increasing L in the neighborhood of the known Tc, and it appears very plausible that 
(m^) vanishes for — > oo for T > Tc and remains non-zero for 7 < 7^. In this plot, error 
bars are not visible because they are much smaller than the line widths. The data were 
generated on a very fine T-grid to produce continuous-looking curves. 

The size dependence of (m^) at three different temperatures is shown in Fig. 13(b). 
For T <Tc it converges with L to a non-zero value, while for T >Tc it decays to as L~^. 
Exactly at Tc the decay follows a non-trivial power- law; oc L-^P/^, where j8 = 1/8 
is the same exponent as in the N = oo magnetization for T < Tc, and v (the correlation- 
length exponent, to be discussed below) equals 1 for this model. This behavior is an 
example of finite-size scaling. A perfect power-law behavior at criticality holds strictly 
only when L — > oo, while for small systems there can be significant corrections to scaling 
(which are unusually small in the case of the Ising model discussed here). Normally we 
do not know the exact value of Tc (which may of course be one reason why a simulation 
is carried out), and procedures for locating the critical point then have to be devised. 
There are many ways to do this, all building in some way or another on the fact that 
the order parameter and related quantities should behave as non-trivial power laws (with 
known or unknown exponents) for large lattices at a critical point. 

Note that the squared magnetization is related to the spin correlation function, 

C{rij)^{GiGj), (49) 
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FIGURE 13. (a) The squared magnetization as a function of temperature for several LxL Ising systems, 
(b) The finite-size dependence at, below, and above Tc. At Tc the behavior is a power law, ^ L^'/^, 
which on this log-log scale corresponds to a line with slope —1/4. For T > Tc the decay to zero is of the 
form L^^, and this is also the rate of convergence for T < Tc- 



where r/y is the distance between the spins. If there is long-range order, then C(r) — )■ (m^) 
when r — 7- oo in an infinite lattice. For finite systems, the same is true for C(rniax) in the 
limit L — )■ oo, where r^ax is the longest distance on a periodic lattice, e.g., r^ax = 

y2Lfor 

an L X L lattice. If there is no order, then C(r) — )• 0, according to a power-law at and 
exponentially for T > Tc. Below Tc, in the infinite syztem the "connected" correlation 
function, 

C*{r)=C{r)-{m)\ (50) 

decays to zero exponentially as r — )■ oo. The exponential forms of both C(r) and C*(r) 
are characterized by a correlation length ^ , which diverges as T — i- Tc from either side. 
Note also that (m^) can be written exactly as a sum over the spin correlations; 

= ^lC(r). (51) 

The convergence of (m^) (both below and above Tc) is simply related to the finite 
correlation length. 

Here it should be pointed out that the nature of the spin correlations below Tc depends 
on the symmetry of the order parameter we are dealing with. For a vector order param- 
eter, e.g., in the Heisenberg model (with classical or quantum spins), the correlations of 
the spin components parallel (longitudinal) to the order parameter of a symmetry-broken 
system decay exponentially, as in the Ising model. The transverse correlations decay as 
a power law, however. This is related to the continuous symmetry of the order parameter. 
In the Ising model the symmetry is discrete — the ordered state breaks the spin-inversion 
symmetry and the free-energy cost of a local magnetization fluctuation is large (propor- 
tional to the boundary of a flipped domain). This leads to the exponentially decaying 
(connected) correlation function in the ordered phase. In the Heisenberg model, on the 
other hand, there are gapless spin- wave excitations (which are excitations of the spins in 
the directions transverse to the ordering direction and are more generally called Gold- 
stone modes). Fluctuations due to these lead to a power-law form of the transverse spin 
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correlations. The transverse correlation length is then formally infinite, and in a finite- 
lattice calculation in which the rotational symmetry is not explicitly broken (so that the 
computed (m^) contains contributions from both longitudinal and transverse correla- 
tions), the size corrections to {m^) are o« 1/L. We already discussed this behavior in the 
ground state of the 2D 5 = 1/2 Heisenberg antiferromagnet in Sec. 2.4, and Fig. 5(b) 
shows the 1/L corrections very clearly. 



3.3.1. Scaling and critical exponents 

To discuss finite-size scaling close to a critical point in more detail, we first have to 
review some basic aspects of critical phenomena in the thermodynamic limit. We will 
only list some of the key results and definitions here; see any standard text on critical 
phenomena (e.g., the book by Cardy [15]) for more details. 

The correlation length is one of the most important concepts underlying the theory of 
phase transitions and critical phenomena. In an infinite system, as the critical tempera- 
ture is approached, the correlation length diverges according to a power law; 

^^\t\-\ (52) 

The exponent v is the same upon approaching Tc from above or below, but the pref- 
actor in (52) is in general different for ? — )■ 0+ and ? — )■ (however, ratios of these 
prefactors — called amplitude ratios — are often universal). Within mean-field theory, 
V = 1/2 . Exactly at 71, although the correlation length is formally infinite, the sys- 
tem is not yet ordered. Instead, for an infinite system the correlation function exactly at 
Tc has the power-law form, 

C(r)~r-('^-2+n), (53) 

where d is the dimensionality of the system and 7] is another critical exponent (also 
called the anomalous dimension, because it can be related to the fractal dimensionality 
of ordered domains at the critical point), the mean-field value of which is T] = 0. Long- 
range order sets in only infinitesimally below Tc, where the asymptotic long-distance 
correlation approaches a constant; C(r ^ oo) = (m^). 

We have already discussed the behavior of the order parameter in the symmetry- 
broken state as ? — > 0~ ; 

(m) ~ \tf. (54) 
We will also be interested in the corresponding susceptibility, defined as 



dim) 

X = 



dh 



^^^ = j{{m^)-{\m\)^). (55) 



where h is the strength of a field coupling to the order parameter, e.g., in the case of 
the Ising ferromagnet a term —h'^Oi in the hamiltonian. The last expression in (55) 
shows explicitly how X is directly related to the fluctuations of the order parameter. The 
susceptibility diverges at Tc; 

X^\t\-l (56) 
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Thus, upon approaching a critical point, the system becomes infinitely sensitive to a field 
h coupling to the order parameter, and exactly at Tc the linear response form (m) = 
ceases to be valid. Instead, at T^, the order parameter depends on the (weak) field as h^/^. 
According to the result in Eq. (39), the mean-field value of the exponent here is 5 = 3. 
The specific heat is also singular at Tc, 

C~M~", (57) 

where a can be positive or negative. In mean-field theory a — 0. When oc < 0, there is 
no divergence, only a cusp singularity at Tc. In some cases, e.g., for the 2D Ising model, 
a = but there is still a weak, logarithmic divergence of the specific heat. 

The critical exponents v, /3 , T], etc., that we have encountered above are not all inde- 
pendent of each other. Relationships between the exponents are explained by the renor- 
malization group theory, which, in the type of order-disorder transitions we are discussed 
here, shows that there are two more fundamental exponents, in terms of which the physi- 
cally observable exponents can be written [15]. Exponent relations had been found using 
other arguments even before the advent of this ultimate theory of phase transitions (in 
the most complete form by Widom in the mid 1960s), e.g., 

7= v(2-77), 

vd = 2-a, (58) 
a + 2/3 + 7=2. 

Such relations are very useful for checking the consistency of numerical calculations 
of the exponents. Exponent relations involving the dimensionality d are called hyper- 
scaling relations and are less generic than the other relations. They are not applicable, 
e.g., within mean-field theory, and, therefore, for any system at or above the upper 
critical dimension. 



3.3.2. The finite-size scaling hypothesis 

The basic assumption underlying finite-size scaling theory [120] is that deviations 
from the infinite-size critical behavior should occur when the correlation length ^ (of 
the infinite system) becomes comparable with the finite system length L. If L » the 
fact that the system is finite should be irrelevant and the infinite-size behavior applies. 
On the other hand, if L <C , then L, not ^ , should be the most relevant length-scale. In 
order to see how the two length scales come into play, it is useful to express quantities 
of interest in terms of the correlation length. Consider a quantity Q which exhibits a 
power-law divergent behavior at Tc (reduced temperature t — > 0), 

!^~l^r^ (59) 

e.g., the susceptibility (in which case k — 7). We can use Eq. (52) to express \t\ as a 
function of the correlation length; 

\t\-^-'/\ (60) 
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and using this we can write Q as 

Q^^^/\ (61) 

This form should apply for ^ ^L, but when ~ L the divergence can no longer continue 
on the finite lattice. The maximum value Qmax attainable by Q on the finite lattice should 
then be obtained by replacing — > L in Eq. (61), giving 

emax(L)~L'^/^ (62) 

In the same way, from Eq. (60) we can also deduce the scaling of the reduced tempera- 
ture at which reaches L, which should also be the temperature at which the maximum 
value of the quantity Q is reached; 

\tmUL)\-^L-'/\ (63) 

It should be noted that this is just a proportionality, and the number in front of LT^/^ 
depends on the quantity. The shift also applies to non-divergent quantities — any feature 
which develops singular behavior as T ^ should shift at the rate L^/^. If there is a 
well-defined maximum or other distinguishable feature in some quantity at T = r*(L), 
then this temperature can be used as a size-dependent critical temperature (and, again, 
this temperature is not unique but depends on the quantity considered). 

The finite-size scaling laws (62) and (63) follow from a more general finite-size 
scaling hypothesis [120], which, like the scaling theory for infinite systems, was initially 
proposed based on phenomenological considerations, but was later derived using the 
renormalization group theory. The hypothesis is that an observable which is singular 
at Tc in the thermodynamic limit scales with the system size close to Tc as a power of 
L multiplied by a non-singular function of the ratio t,/L. Any singular quantity (not 
necessarily divergent) should thus be of the form 

Q{t,L)^L^f{^/L), (64) 

which, using ^ ~ we can also write as 

Q{t,L)=L^g{tL'l''). (65) 

This scaling law should hold both above {t > 0) and below (t < 0) the critical point. 
Exactly at Tc, we recover the size-scaling (2(0, L) ~ L^. To relate a to the standard 
critical exponents, we can use the fact that, for fixed t close to 0, as the system grows the 
behavior for any ? 7^ eventually has to be given by Eq. (59); 2(?,L— )-oo) ~ Irl"*^ (where 
K is negative for a singular non-divergent quantity, e.g., the for the order parameter we 
have JC = — j8). To obtain this form, the scaling function g{x) in (65) must asymptotically 
behave as g{x) ~ x^"^ for x ^ 00. in order for the size-dependence in (65) to cancel out, 
we therefore conclude that o — k/v, i.e., 

Q{t,L)=L-/''g{tL'/'). (66) 
To extract the scaling function g{x) using numerical data, one can define 

yL = Q{t,L)L-^/\ XL = tL'/\ (67) 
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FIGURE 14. Monte Carlo results for the susceptibility (55) of the Ising model on several different LxL 
lattices, (a) shows the temperature dependence, with the vertical line indicating 7^ . Note the vertical log 
scale. In (b) the data has been scaled using the exact values of the Ising exponents, 7 = 7/4 and V = 1, 
and the exact value of Tc in f = (T — Tc)/Tc. 



and plot yt versus xl for different system sizes. If the scaling hypothesis is correct, 
data for different (large) system sizes should fall onto the same curve, which then is 
the scaling function (this is referred to as curves collapsing onto each other); g{x) = 
yL^oo{x). Fig. 14 illustrates this using Monte Carlo data for the magnetic susceptibility 
of the 2D Ising model. The peak location in panel (a) clearly moves toward the known 
Tc with increasing L. After scaling the data according to the above procedures, as shown 
in panel (b), the curves indeed collapse almost onto each other close to ? = 0, but further 
away from the critical point deviations are seen for the smaller systems. These are due to 
corrections to scaling, which in principle can be described with subleading exponents. 

We can apply the scaling form (66) to the correlation length itself, for which K" = v and 
the L-scaling is independent of model- specific exponents. In cases where the universality 
class is not known a priori, this is useful for extracting the exponent v by curve- 
collapsing ^ /L data without having to simultaneously adjust another exponent K in (66). 

Practical definitions of the correlation length. The correlation length can be defined 
in various ways, not necessarily just based on the asymptotic decay of the correlation 
function (which is often difficult to extract reliably). One practical and commonly used 
correlation length definition is based on the Fourier transform of the correlation function, 
often called the (static) structure factor, 

5(q) = (a-qaq) = ^e-'''--C(r) = j;cos(q-r)C(r), (68) 

r r 

where Oq is the Fourier transform of an individual spin configuration, 

aq = -Lyajc''i'J. (69) 

We denote by Q the wave- vector of the dominant correlations — for a ferromagnet 2 = 0, 
for a 2D antiferromagnet Q = (;r, n), etc. To simplify the notation, q will be used for the 
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deviation from Q. Then qi = 2 ;r/L corresponds to one of the wave-vectors closest to Q, 
e.g., Q + {2k/L)x, where x is the reciprocal-space unit vector in the jc-direction. 
A correlation length can be defined using the structure factors at ^ = and qi; 




One can show that, for a J-dimensional lattice, if the correlation function is given by the 
Omstein-Zemike form (obtained in mean-field treatments [15]), 

Coz(r)~r-i(^-2)exp(-r/0, (71) 

then t,a is related to the original correlation length t, appearing in the exponential decay 
of this correlation function according to 



4. = 4^'ii±i^. p2, 

Thus, for J = 1 and 3, ^a = ^, whereas the 2D case is special, with = (^(15/16)^/^ 
(or, one may say that d — 1, 3 are special cases, since the factor differs from one also for 
all d > 3). The Omstein-Zemike form is normally valid in disordered phases for r » 
[15]. Deviations from this form at short distances imply that (72) is not true exactly, but 
regardless of the short-distance behavior this relation holds exactly when ^ oo. 

In the case of a long-range ordered classical system, t,a normally diverges as T — )■ 
0, for any L, because there are no fluctuations in the ground state (and, thus, the 
structure factor vanishes for q ^O.ln order to remove the contributions from the non- 
decaying part of the correlation function in an ordered system, we can use the connected 
correlation function (50). Although (m)^ is not uniquely defined for a finite system, one 
can, e.g., subtract C(rniax)- As an altemative, we can use a different correlation length 
definition, based on the structure factor at qi and q2 = 2qi= Ak/L; 



j S{qi)/S{q2)-l 
^iV4-5(^i)/S(^2)- ^ ^ 

One can show that (72) also holds for this definition if C(r) is given by the Omstein- 
Zemike correlation function, also when a constant corresponding to the long-range order 
is added for T <Tc [since this affects only 5(0)]. 

Fig. 15 shows results for ^a/L and ^b/L for the 2D Ising model. Note that a logarith- 
mic scale is used in the case of the T — )■ divergent ^a/L, while ^b/Lis convergent and 
graphed on a linear scale. Both quantities exhibit size independence (curves crossing 
each other) at Tc, but the values there are clearly different. This is because the Ornstein- 
Zemike form of the correlation function applies asymptotically only for T > Tc, and 
there is no reason why the two definitions and ^b should agree exactly at Tc (although 
their values should be related). Their values are very similar for large systems close to 
Tc in the disordered phase. The crossing points (or peak location of ^b) can be used to 
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FIGURE 15. Temperature dependence of the two correlation length definitions, Eqs. (70) and (73), 
normaUzed by the size L for L x L Ising models. The vertical lines indicate 71 . 



extract Tc in systems where it is not known. The temperature axis can also be scaled in 
the same way as in Fig. 14 to extract the correlation-length exponent. 

Note that for the small number of q-points needed, 5(q) can be efficiently evaluated 
using the Fourier transform (69) of the spin configurations generated in Monte Carlo 
simulations. Since the structure factor is real- valued we have 

5(q) = (Re{cTq}2) + (im{cTq}2). (74) 

Computing the full correlation function C(r) and Fourier-transforming it post-simulation 
is much more time-consuming. 



^2 = ^jK2- (75) 



Binder ratio and cumulant. Besides ^/L, there are also other dimensionless quanti- 
ties that are size-independent at the critical point and useful for extracting indepen- 
dently of the values of the critical exponent. The perhaps most frequently used one is 
the Binder ratio [121, 122]: 

M 

At Tc, the power laws cancel out, and the ratio is L-independent (and also universal), up 
to subleading finite-size corrections. Normally, graphing R versus T for different system 
sizes produces curves that intersect each other close to Tc. Locating the points where R2 
for pairs of system sizes (e.g., L and 2L) cross each other, one obtains a size dependent 
critical point which typically converges faster than the L^^/^ shift in (63). One can think 
of this as being a results of the leading corrections canceling in a quantity involving two 
system sizes, and one is then left with something which approaches Tc according to a 
faster, higher-order scaling correction. One can also define ratios similar to (75) based 
on other powers of m, e.g., Ri = (m^) /(|m|). The curve-crossing method for locating Tc 
can also be applied in the same way with <^/L. 

The Binder ratio also has other interesting properties. In the case of a scalar order 
parameter (e.g., for the Ising model), the Binder cumulant is defined as 

U2 = 1(\-\r^. (76) 



2 V 3 
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FIGURE 16. The Binder cumulant (76) for L x L Ising models. In (a) on can see the approach to the 
limiting values U2 -^0 (for T > Tc) and C/2 ~> 1 (for T < Tc) for increasing L. In (b) the data close to Tc 
(vertical line) are graphed on a more detailed scale and for larger L to show the crossings of the curves. 



In an ordered state, U2 ^ I {or N ^ 00, because the magnetization distribution P{m) 
then approaches two delta-functions at ±(|m|), and, therefore, R2—^ l.ln contrast, in a 
disordered phase, the fluctuations of m are Gaussian around m = (following from the 
central limit theorem, since fluctuations in regions separated by distance ^ in a large 
system are uncorrelated). Based on Gaussian integrals, 7?2 — ^ 3 and U2 — )■ 0. 

Generalizing the Binder cumulant to an n-component order parameter (where n = I 
for the Ising magnetization, n = 2 for the XY model, etc.), one should keep in mind that 
= m ■ m and in (75) are insensitive to angular fluctuations of the order parameter. 
Integrating a Gaussian distribution of |m| over the n-dimensional space to compute the 
averages in (75) does, however, introduce n-dependent factors. To reproduce the above 
properties of U2 one therefore has to define it for a general order parameter as 

Fig. 16 shows Monte Carlo results for U2 as a function of T for several 2D lattices. The 
evolution into a step-function at Tc with increasing system size can be seen clearly. In 
this case all the curves cross each other very close to the known Tc, reflecting very small 
subleading corrections in this model. It is common in other systems that the crossings 
exhibit some drift, and a careful extrapolation of crossing points have to be carried out 
(e.g., based on data sets for sizes L and 2L, or some other aspect ratio). 

Finite-size scaling in practice. We briefly discuss how to carry out finite-size data- 
collapse in practice. The number of parameters involved (i.e., Tc as well as one or two 
exponents, and possibly also exponents of subleading corrections to be discussed below) 
is rather small, and normally one has some rough knowledge of their values just from 
looking at raw data and doing some initial experimentation, e.g., by just locating a non- 
trivial power-law behavior as in Fig. 13 [and this may often be enough to determine the 
exponent ratio k/v in Eqs. (62) and (66)]. An analysis of the Binder cumulant or ^ /L 
may already have given Tc to adequate precision, but it may still be useful to check the 



Computational Studies of Quantum Spin Systems 



January 18, 2011 



40 




0.0 0.2 0.4 

1/v 



2x10 



1x10" 



>~1 



-1x10" 



-2x10" 



^ 



(b) 



x=tL 



-0.4 -0.2 0.2 0.4 
1/v 

x=tL 



FIGURE 17. (a) Best-fit scaling collapse of the susceptibility of the 2D Ising model; the same data 
as in Fig. 14, but including larger lattices and adjusting 7^ in t — {T — Tc)/Tc as well as the exponents 
V,7 to minimize with respect to a scaling function g{x) in the form of a polynomial (here of fourth 
order, shown as the solid curve). The optimal values for this data set are: T^/J = 2.26921 ±0.00002, 
V — 0.9985 ± 0.001 1, and y— 1.750 ±0.002, where the error bars (one standard deviation) were computed 
by repeating the fit several times with Gaussian noise (of magnitude equal to the Monte Carlo error bars) 
added to the data, (b) shows the data with the fitted scaling function g{x) subtracted, so that the error bars 
become visible. The fit is statistically sound, with ~ 0-9 (per degree of freedom). 



sensitivity of other fits to its value. Thanks to the power of modern computers, as an 
alternative to using some complicated multi-dimensional optimization procedure, one 
can write a simple brute-force computer program to search for the best set of parameters 
on a suitable finite mesh. The goodness of the data collapse produced by a set of 
parameters can be quantified as the ;f^-value obtained by fitting a single high-order 
polynomial through all the scaled data points {xi^yi), defined in (67), simultaneously 
for all L for which data are available. Normally there is a large number of data points, 
for different couplings and system sizes, and, since the scaling function should be well 
behaved, a polynomial of reasonable order (3th-8th, as a rough guideline) should work 
well within the window where the data can be collapsed. The size of this window also 
has to be adjusted until the data collapse well. As an example, for the data in Fig. 14, 
X in the range (—0.5,0.5) should be appropriate, although the window also depends on 
the system sizes included in the analysis and the the error bars (which determine the 
sensitivity to neglected subleading scaling corrections). 

It is not always easy to determine reliable error bars for parameters obtained in this 
kind of fitting. Beyond the purely statistical errors (which can be determined, e.g., by 
repeating the x'^ minimization several times with Gaussian noise, of magnitude equal 
to the error bars, added to the data), there are also systematical errors due to scaling 
corrections, which can be difficult to estimate. If is statistically reasonable (i.e., close 
to 1 per degree of freedom), one would normally assume that the neglected corrections 
have not influenced the parameters beyond the statistical uncertainties. 

Fig. 17 shows an example of data collapse. The critical exponents and Tc of the 2D 
Ising model were determined using data for L G {64 — 512} in the window |? |L^/^ < 0.5 
(where t contains the variable Tc adjusted in the procedure). The results, listed in the 
figure caption, are in excellent agreement with the known and the 2D Ising exponents, 
and the fit is also statistically very good. Using the same fitting window, a statistically 
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acceptable fit cannot be obtained if much smaller lattices are included, e.g., including 
L — 32 gives 2 (per degree of freedom), which is marginally too large for the 
number of degrees of freedom of the fit. In this case Tc = 2.26924 ± 0.00002, about 3 
error bars above the true value, while the exponent are correct to within two error bars. 

Corrections to scaling . As a check, it is often a good idea to include some correc- 
tions to the leading scaling forms, and some times it is even necessary to do so in order 
to obtain good fits. The most commonly used method is to do the data collapse with 

jL = Q{t,L)L-^l\\+aL-'')-\ XL = tL^I\ (78) 

where CO is a subleading exponent and a is a constant. Here there is no correction to 
the xi scaling (the finite-size shift of the critical point), but such a correction can in 
principle be included as well [85]. Normally, good fits with corrections can be obtained 
also when lattice sizes are included that have to be left out if no corrections are used. 
A larger range of system sizes can partially compensate for the fact that the statistical 
uncertainties of all parameters increase when more parameters are included. If consistent 
leading critical exponents are obtained in fits both with and without corrections, then one 
can be reasonably certain that the results are correct. 

3.4. First-order transitions 

The scaling properties we discussed in the preceding section apply at continuous 
phase transitions, where the correlation length diverges. At first-order (discontinuous) 
transitions, the correlation length remains finite at the transition point and the order 
parameter, as well as other quantities, exhibit discontinuous jumps. The discontinuities 
develop in the limit of infinite system size, normally according to power-laws which 
can also be studied using finite-size scaling techniques. The exponents associated with 
these powers-laws are typically trivially related to the dimensionality of the system 
[123, 124, 125]. For instance, the specific heat diverges with the system size as at 
a first-order transition, instead of L"/^, with a typically very small (or even negative) a, 
at a continuous transition. The shift of the critical point with the system size scales as 
L~'^, instead of the Lr^/^ shift of a continuous critical point. 

Although finite-size scaling with exponents equal to d in principle makes it easy to 
recognize a first-order transition, studies of weak first-order transitions are difficult, be- 
cause they exhibit large corrections to the leading scaling forms. It may then be difficult, 
with system sizes accessible in practice, to clearly distinguish slowly developing discon- 
tinuities from weaker singular behavior at a continuous transition. 

Strongly first-order transitions are also difficult to study, for a completely different 
reason. A Monte Carlo simulation may get stuck in a meta-stable state, in which case 
computed quantities do not correspond to correct thermal averages (which, on the other 
hand, is completely analogous to real systems, for which metastability and hysteresis ef- 
fects are hall-marks of first-order transitions). It may then even be difficult to accurately 
locate the transition point. To alleviate such problems, various multi-canonical Monte 
Carlo methods have been developed in which the configurations are sampled in an ex- 
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tended ensemble where the temperature of the system is also fluctuating (tempering or 
parallel tempering [126, 127] methods), or with a distribution different from the Boltz- 
mann probability [128, 129, 130] (to which the measurements are re-weighted), which 
makes it easier for the system to explore the configuration space. 



3.4.1. Phase coexistence 

One of the most important characteristics of a first-order transition is phase coexis- 
tence. In Monte Carlo simulations at the transition point (in practice in a small window 
which shrinks to a single temperature with increasing system size) this is manifested in 
the generation of two types of configurations, corresponding to the two distinct phases 
existing just above and below the transition temperature. This is provided that the full 
configuration space can be ergodically sampled, which, as discussed above, is not always 
possible in practice. In small systems, there will also be configurations that cannot be 
clearly associated with one of the phases — they correspond to the fluctuations (domain 
walls) required for the system to transition between the two phases. Fig. 18 illustrates 
this schematically for an Ising order parameter. The transition window in which a three- 
peak distribution can be observed narrows rapidly with increasing system size, and the 
peaks develop into delta-functions. When the weight between the peaks becomes very 
small, it may in practice not be possible to ergodically sample the configurations, as the 
system gets trapped within the sub-space corresponding to just one of the peaks (in a 
way exactly analogous to the symmetry-breaking discussed in Sec. 3.2). 

Note that the type of order-parameter distribution in Fig. 18 does not apply to the field- 
driven first-order transition of the Ising model, with magnetization curves illustrated in 
Fig. 10. In that case there would be just two peaks, with weight transferring between 
them as the field is tuned through h — 0. The phase coexistence at h — (and T < Tc) 
corresponds to the two symmetric peaks in the distributions in Fig. 12. The similarity 
between this case and a paramagnetic-magnetic transition (with a general n-component 
order parameter m) can be made clearer by considering the one-dimensional distribution 
of |m|. This distribution has two peaks, one at |m| = and one at |m| > 0, when an 
ordered and disordered phase coexist. Note again that in continuous transitions (Fig. 12) 
the central peak continuously splits into two peaks below Tc, in contrast to the two 
ordering peaks emerging at a non-zero value of |m| in the first-order case. 

The Binder ratio, Eq. (77) in the case of a generic n-dimensional order parameter 
depends only on the distribution of |m| . It has a very interesting property at a first-order 
transition. One can easily check, using, e.g., an idealized co-existence distribution of the 
type P{\m\) = (1 — p)G{\m\) + pd{\m\ — m*), where G is a normalized "half-Gaussian" 
(defined for |m| > 0) and < m* < 1, that U2 exhibits a negative divergence when the 
ordered-phase weight p is tuned from to 1, and if the width of the Gaussian vanishes 
(corresponding to infinite system size). While the full m distribution of course contains 
more information, locating a window of negative Binder cumulant and checking for a 
divergence of the peak value can be a practical way to analyze a first-order transition. 
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FIGURE 18. Evolution (schematic) of the magnetization distribution of a finite Ising ferromagnet close 
to a first-order transition. There is a rapid transfer of weight between the central peak (corresponding to 
the disordered phase) and the two peaks at non-zero magnetization (the two ferromagnetic states) as the 
temperature is tuned through the transition region. The peaks become delta-functions in the limit of infinite 
system size. The transition temperature Tc can be defined as a point with some specific (but essentially 
arbitrary) feature of the three peaks, e.g., equal weight or equal height of the peaks. 



3.4.2. Frustrated Ising model 

Here we look at a particular example of a first-order transition, in the frustrated 2D 
square-lattice Ising model with hamiltonian 

Ea = -J\ + L (79) 

('7>2 

where both coupUngs /i ,72 > (but note the different signs in front of the parameters). 

The first term (where (/j) j denotes nearest-neighbor spins) is then the standard Ising fer- 
romagnet, whereas the second term (where {ii)2 refers next-nearest-neighbor spins, i.e., 
across the diagonals on 2 x 2 plaquettes, as in Fig. 7) is antiferromagnetic and causes 
frustration. For coupling ratios g = J2/J1 < 1/2 the ground state of the system is fer- 
romagnetic (fully polarized). In the limit g ^ °° the system reduces to two decoupled 
antiferromagnets, with four striped (or coUinear) ground states, such as the one illus- 
trated in Fig. 7(c). These remain the ground states for all g > 1/2. At the point g — 1/2 
the ground state is massively degenerate. Note that the case of /i < is equivalent to 
J I > 0, through the invariance of Ea under the transformation a, — )■ — (T,- on one of the 
checker-board sublattices; thus in this case the ferromagnetic phase is replaced by an 
antiferromagnetic one, whereas the striped phase is not changed. 

The frustrated Ising model has been studied for a long time, but some features of its 
phase diagram are still debates or were resolved only recently [95, 96, 97]. For ^ < 1/2, 
the transition is of the standard Ising type, but is difficult to study close to ^ = 1/2, 
due to large scaling corrections and long Monte Carlo autocorrelation times. Here we 
will consider only g > 1/2, for which there is a first-order transition between a high- 
temperature paramagnet and a low-temperature striped phase, up to a coupling ^* ^ 0.8 
[97] after which the transition becomes continuous. We analyze results obtained with 
the standard single-spin Metropolis algorithm (as cluster Monte Carlo methods do not 
work in the presence of frustration). Temperatures will be quoted in units of Ji. 



Computational Studies of Quantum Spin Systems 



January 18, 2011 



44 



T=0,771 



T=0.772 



T=0.773 



FIGURE 19. Order parameter distributions in the {mx,my) plane (in the full space |m, | < 1, \my\ < 1) 
of the 2D frustrated Ising model of size L = 128 at coupling ratio g = 0.55. Brighter features correspond 
to higher probability density. The temperatures (indicated in the panels in units of Ji) are in the first-order 
transition region, with the phase coexistence (a central peak as well as four peaks corresponding to x- and 
);-oriented stripes) seen most clearly at g = 0.772. 

Order-parameter distribution. The ordered phase can have its stripes oriented either 
along the x or j-axis, with the corresponding order parameters 

= 77 E ^'(-1)"' ^y = -^t Oi{-\)y^, (80) 

where Xi and ji are the (integer) lattice coordinates of site i. Let us first look at the 
order-parameter distribution P{mx,my). If the four possible ordered states are sampled 
equally below Tc, the distribution should be four-fold symmetric, with peaks located 
on the negative and positive x- and y-axis. In the paramagnetic phase there should be a 
single central peak. Coexistence at a first-order transition should hence be reflected in the 
presence of five peaks in this case (instead of the 3-peak distribution for the scalar order 
parameter in Fig. 18). Fig. 19 shows results for an L = 128 system at ^ = 0.55, for three 
temperatures in the transition window. A very distinct four-fold symmetry can be seen in 
all cases. Four symmetric peaks clearly indicate an ordered phase at T = 0.771, but there 
are strong fluctuations, reflected in weight extending to the center of the distribution. At 
T = 0.773, the distribution is peaked at the center, but weight also extends far into the 
ordered regions. Between these temperatures, at T = 0.772, one can discern both the four 
ordering peaks and a central peak. All these plots show the hall-marks of coexistence; 
even when there are not five peaks present, there is still significant probability for both 
ordered and paramagnetic configurations. Away from the narrow transition window, the 
distribution rapidly turns into one with either distinctly ordered or disordered features. 

Binder cumulant. Let us analyze the Binder cumulant, using = + in 
Eq. (77). The question now is what order-parameter dimensionality n to use in this 
definition. At high temperatures, fluctuations of and my in different parts of a large 
system are uncorrelated. This implies a rotationally-symmetric (circular) distribution 
P{mx, my) . The n-dependent factors are intended to make U2^0forT^ oo, and in this 
case we should therefore use n = 2. This is correct also at low temperatures (to guarantee 
f/2 — ^ 1)5 because in an ordered phase the Binder ratio, Eq. (76), R2 I when the 
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FIGURE 20. Binder cumulant across the transition to the striped phase of the frustrated Ising model 
for coupHng ratios g = 0.55 (left) and g — 0.70 (right). The divergent negative peak developing with 
increasing L is an unambiguous signal of a first-order transition. 



system size diverges, regardless of the order parameter structure. Fig. 20 shows results 
as a function of temperature at two coupling ratios; g = 0.55 and 0.70. For g = 0.55, 
a negative Binder cumulant can be seen for L > 8, with the negative peaks becoming 
very narrow and apparently diverging as the system size is increased. At ^ = 0.70, 
the transition is still first-order, but with weaker discontinuities that start to manifest 
themselves as co-existence and a negative Binder cumulant only around size L = 32. 
In both cases, the Monte Carlo simulations were still ergodic, but a large number of 
updating sweeps (~ 10^ for each case) had to be carried out to obtain smooth curves. 

Discontinuities and finite-size scaling. In first-order transitions that are not very 
strong, as in the systems above, it may not be easy to accurately extract the size of 
discontinuities in physical observables. An example is shown in Fig. 21, where the 
temperature dependence of the internal energy is graphed for several lattice sizes. While 
a discontinuity (latent heat) clearly develops with increasing L, it is not easy to determine 
exactly between which two energies the jump will eventually take place. This requires a 
careful analysis using larger lattices. 

Fig. 21 also shows the maximum value of the specific heat versus the system size for 
two different coupling ratios. At ^ = 0.51, the transition is rather strongly first-order, 
and a scaling consistent with the expected Cmax ~ can be observed for the largest 
lattices. In contrast, at ^ = 0.55 the behavior appears to follow a different power law 
(with an exponent close to 1.2) up to L !^ 64. For larger lattices the results start to show 
a somewhat more rapid divergence, however, and eventually, for very large lattices, one 
would expect the exponent to be equal to J = 2 also in this case. 



3.5. Spin stiffness and the Kosterliz-Thouless transition 

An important aspect of a system with long-range magnetic order is that it exhibits a 
non-zero spin stiffness. For a system with continuous vector spins (XY or Heisenberg 



Computational Studies of Quantum Spin Systems 



January 18,2011 



46 



-0.7 

-0.8 

lil -0.9 

-1.0 

-1.1 


FIGURE 21. Left: Internal energy versus temperature for the frustrated Ising model at ^ = 0.55. The 
discontinuity developing with increasing lattice size corresponds to the latent heat. Right: Finite-size 
scaling of the peak value of the specific heat a.tg = 0.5 1 and 0.55. The line shows the expected asymptotic 
scaling at a first-order transition. 



models), the spin stiffness is the analogue of an elastic modulus of a solid. It is also often 
refereed to as the helicity modulus [131]. To study this concept, we here consider the XY 
model, with the hamiltonian written as 

// = -7£cos(0/-0^), (81) 

where 0,- G [0,2;r] is the angle characterizing spin /. For simplicity we assume only 
nearest-neighbor interactions, but generalizations to arbitrary interactions are straight- 
forward. When deriving an expression for the spin stiffness, we will consider the 2D 
square lattice (and some times use a ID chain for simplicity), but later will study 
also a 3D cubic system. In 2D XY systems, the spin stiffness is an important quantity 
characterizing the unconventional (topological) Kosterlitz-Thouless transition exhibited 
by this system, which we also discuss briefly in this section. 

3.5.1. Definition of the spin stiffness 

Loosely speaking, the spin stiffness Ps characterizes the tendency of ordered spins to 
adapt in response to perturbations imposing modulations of the direction of the order 
parameter (in contrast to the susceptibility, which measures the tendency of the order 
parameter to change in response to a field applied in a fixed direction). It is analogous to 
the shear modulus in continuum mechanics, which characterizes the tendency to shape 
deformation of an elastic object (while the compressibility corresonds to the tendency to 
volume change with maintained shape). The definition of the spin stiffness is easiest to 
understand at T = 0, which we consider first, before generalizing to T > 0. 

The stiffness at T = 0. We will first consider a system with open boundaries in the 
X direction, while periodic boundaries in the y direction can be assumed (and later we 
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FIGURE 22. A 2D classical XY model with a phase twist in the x-direction imposed by fixing the spins 
in the boundary column at a relative angle <I>. To minimize the energy the total twist <I> is distributed 
evenly, so that the spins in neighboring columns are twisted hy (j) — O/L^ relative to each other. 



will generalize to periodic boundaries also in the x direction). Fig. 22 illustrates how 
ferromagnetic XY spins at T = adapt in order to minimize the energy when an over-all 
change in the spin angle 4> between the left and right boundaries is imposed (e.g., due to 
strong magnetic fields applied at the boundary columns). To minimize the energy, each 
column is twisted with respect to the following column by an angle ^ = 4>/ (L^ — 1), 
where is the length of the system in the x direction (the number of columns). At 
r = 0, we are interested in the energy in the presence of this twist, which for the 2D XY 
model is simply given by 

£(0) = ~J{L,-l)LyCos{(j)) = E{0)+J{L^-l)Ly[l-cos{(l))]. (82) 

For small we get E{(j)) —E{0) = {J/2){L^— 1)^^^ to leading-order. Motivated by 
this result, the T = spin stiffness is defined as 

P'-N^^' ^^^^ 

and we have = J for large A'^ [or for any A'^ if we normalize by the number Ly{Lx — 1) 
of interacting x bonds, over which the energy cost due to the twist is distributed]. 

Normally it is more convenient to consider a periodic system. To derive an expression 
for the spin stiffness in this case, a phase twist is first imposed at the boundary. For 
simplicity we work this out for a ID chain, with spin angles 0;^, x = Q,. . . ,N —I, but the 
calculation can be trivially generalized to higher dimensionality. The interaction energy 
for each bond is = —7 cos (0^+1 — ®x), except at the boundary, where a twist ^» 
corresponds to E^ = — 7cos(0o — 0a?-i +^). The configurations minimizing the energy 
have 0X+1 — 0x = 5, where 5 is independent of x, which gives the total energy 

E{^) = ~{N- l)cos(5) -cos(4)- [A^- 1]5), (84) 

which is minimized by 5 = (j) = ^/N, with£'(0) = E{0) +JN[l — cos(0)], and the spin 
stiffness defined according to (83) is again ps = J. 

It is useful to consider also another way of twisting the spins in a periodic system, 
by introducing a twist field 4>x = x^ in the hamiltonian. For a ID system the energy for 
each bond in the presence of this field is —7 cos (0^+1 — 0x + ^x+i — ^x) • To treat the 
boundary correctly, should be considered as a function of a continuous variable x. 
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which jumps discontinuously from to at ^ = A/^. The phase difference appearing in 
the XY interaction should then be interpreted as 

rx+l rx+l 

4>;,+i-^>;,= / ^xdx= / (j)dx = (l), (85) 

Jx Jx 

which holds also at the boundary {x + 1 = N). Apart from factors, is analogous to a 
flux threading the ring. The energy in the presence of this twist is minimized for &x — & 
independently of x, giving E{^) = E{0) +JN[l — cos(0)] as before in the case of the 
twisted boundary condition. One can consider the twist field as a way of transforming 
away the twisted boundary condition (rotating to the reference frame of the spins in the 
presence of the boundary twist), which makes many calculations easier in practice. We 
will work with the twist field from now on. 



The stiffness at T > 0. Now we consider non-zero temperatures, in which case the 
spin stiffness is defined as 

1 d^F{(j)) 

where F(0) is the free energy in the presence of a twist field (or, equivalently, a twisted 
boundary condition), which in turn is related to the partition function according to 



F(0) = -lln[Z(0)]. (87) 

For r — )• 0, Eq. (86) clearly reduces to the ground-state energy definition (83). 

Applying a twist field which only depends on x, ^{x,y) = x<j), in the 2D XY model, 
we have the hamiltonian 

^(0) = --^ L cos(0 + Sj - Si) -JY, cos(0y - (88) 

{iJ}x {i,j)y 

where we can assume periodic boundary conditions in both directions (although in 
principle the y boundary could be open). We can simplify the dependence on by using 
a standard trigonometric equality, 

cos(0 + @j - @i) = cos(0y - 0;) cos(0) - sin(0y - 0,) sin(0), (89) 

which we expand to second order in : 

cos((^ + - 0,) cos{Qj - - (j)^) - sin(0y - 0,)0 + ^(0^). (90) 

The hamiltonian in the presence of a small (<j> — > 0) twist can then be written as 

Hi(l>) ^ HiO) + \(^^Hx - (91) 

where is the jc-bond part of the hamiltonian at = and 4 is the total spin "current" 
in the x lattice direction: 

= sin(0,--0O. (92) 
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The partition function in the presence of a small twist can then be written as 

Z(0) = yj[0]e-^^W (93) 
^ |j[0]e-^^(o)[l-ij802/f_^+...][l + j804+i(j80/,)2 + ...], 

where the exponentials involving H^^ and have been Taylor expanded to the orders 
needed. We can now write this in a form with expectation values over the distribution 
for = 0. To second order in : 

Z(0) ^ Z(0)[1 - \fi(^^{H,) +j80(/,) + ij8202(72)]. (94) 

By symmetry {Ix) = and the free energy (87) is given by 

F{^)=F{Q) + \<^\{Hx)-fi{ll)), (95) 

and from this we can extract a simple expression for the spin stiffness (86): 

ps=]^m)-fi{il)), (96) 

which can be evaluated using Monte Carlo simulations. This result is for a twist field in 
the X lattice direction. For a J-dimensional isotropic system, we can average over all the 
equivalent directions and write the spin stiffness as 

where the index a corresponds to the current in lattice direction a. For an anisotropic 
J-dimensional system, the stiffness in general is different for all lattice directions, i.e., 
there are d different stiffness constants, each of them given by a form like (96). 

The twist-field definition of the stiffness can be easily generalized to systems with 
longer-range interactions. The forms (96) and (97) remain valid, with the current Ix 
containing contributions from all interactions exactly as in the hamiltonian. 

Relation to superfluidity and superconductivity. A very interesting aspect of the spin 
stiffness of the XY model is that it can be directly related to the superfluid density of a 
superconductor (or a superfluid such as ^He) [131, 132]. Like the magnetization of the 
XY model, the order parameter of a superconductor or a superfluid is i7(l) symmetric 
(corresponding to the global phase of the wave function), and the twist field we discussed 
above is directly analogous to a magnetic flux (in the case of a superconductor). A non- 
zero stiffness corresponds to the Meissner effect exhibited by a superconductor. Monte 
Carlo simulations of the 2D XY model directly aimed at properties of thin superfluid 
films are discussed in Refs. [133, 134]. 
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FIGURE 23. (a) Temperature dependence of the spin stiffness of the 3D XY model for different lattices 
of size N^L\ (b) Finite -size scaling according to Eq. (99). Curves of Lps for different L cross each other 
(asymptotically for large L) at the critical temperature. These results were obtained using a Monte Carlo 
cluster algorithm [119], and in all cases the error bars are too small to be visible. 



Spin stiffness scaling. The critical scaling properties of the spin (or superfluid) 
stiffness were first worked out in the context of superfluids [131, 132]. In the infinite 
J-dimensional system with J > 2, it was shown that 

p,~(r,-r)('^-2)v^ (98) 

for T —fTc from below. Here v is the standard correlation length exponent. According to 
the general finite-size scaling relation (66), the size dependence of the stiffness exactly 
at Tc is then given by 

ps ~ L^^'^. (99) 

The stiffness is therefore, like the correlation length and the Binder ratio, a useful 
quantity for locating the critical point without having to adjust any unknown (or not 
precisely known) exponents. This is illustrated with Monte Carlo results for the 3D XY 
model in Fig. 23, where Lp^ is size independent at the critical point. 



3.5.2. Kosterliz-Thouless transition in two dimensions 

In the 2D XY model there can be no transition into a phase with long-range magnetic 
order at T > 0, according to the Mermin-Wagner theorem [38] (as discussed in Sec. 2). 
Remarkably, this system exhibits a different kind of phase transition, where no long- 
range order develops but the spin correlations change from exponentially decaying to 
a power-law form [135, 136]. This Kosterlitz-Thouless (KT) transition is topological in 
nature, being a consequence of proliferation of unbound vortices (which are topological 
defects) in the spin configurations at temperatures T > Tkj. For T < Tkt, vortices 
also exist (at a density which vanishes at T — )■ 0) but they are all bound in vortex- 
anti vortex pairs, which have no net vorticity (and therefore vanish upon course graining 
of the spins). The power-law form of the spin correlations, C(r) ~ r^^, applies for all 
< r < Tki- The exponent rj is temperature dependent, with the value rj = 1/4 exactly 
at Tkt and rj — > as T — )• 0, so that true long-range order exists at T = 0. 
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Although there is no long rage order for < T < Tkt, the spin stiffness is actually 
non-zero in the KT phase — power-law correlations with a sufficiently small exponent are 
enough to support an energy cost of a boundary twist. There is no power-law onset of the 
stiffness, but instead an even more prominent signal of the transition; a discontinuous 
jump at Tkt from ps = to a non-zero value. Renormalization- group calculations for 
the continuum field theory corresponding to the 2D XY model have given very detailed 
information about the KT transition. One of the most important results is a rigorous 
relationship (due to Nelson and Kosterliz [139]) between the transition temperature Tkt 
and the spin stiffness exactly at this temperature (i.e., the size of the discontinuity); 

Ps{Tkt) = (100) 
n 

For finite lattices, the stiffness at Tkx approaches the infinite-size value with a logarith- 
mic size correction [138]: 

ft(7-KT,L)=ft(rKT,~)(l+^^^). (101) 

where c is a system dependent parameter. Thus, one can say that the general finite-size 
scaling law (99) for Ps at a critical point holds, but with a logarithmic correction to the 
leading-order size-independent form obtaining when d = 2. 

Fig. 24(a) shows Monte Carlo results for the stiffness of the 2D XY model versus 
temperature. The jump expected in the thermodynamic limit is approached very slowly 
as a function of size. This can be related to the fact that the correlation length for T > Tkt 
does not diverge as a power-law, but according to the exponential form 



«/(7'-7kT)l/2^ (102) 



where a depends on details of the lattice and the interactions. Therefore, using the same 
arguments as we in the case of a standard critical point in Sec. 3.3.2, the finite-size shift 
of Tkt [defined using, e.g., the temperature T*{L) at which drops the most rapidly] is 
given by the form, 

r*(L)-rKT (103) 

^ ^ ln2(L)' 

which is much slower than the conventional power-law shift T*{L) — ~ L^/^. 

Using the standard finite-size scaling hypothesis (64) and replacing (the size 
dependence of the singular quantity exactly at the critical point) by the logarithmic size 
correction in (101), we can write a hypothesis for the size and temperature dependence 
of the spin stiffness at the KT transition as 

which, after taking the logarithm of the argument of the scaling function f{x), can be 
written in terms of another function g[ln(.x:)] as 

^^^^'^^ (^ + 21n(L)+c ) =s[HL)-a/{T-T^Ty/\ (105) 
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FIGURE 24. (a) Monte Carlo results (obtained with a cluster algorithm [119]) for the temperature 
dependence of the spin stiffness of the 2D XY model for several lattices of size L — 2". A discontinuity 
develops at Tkt as L oq, as indicated with the vertical line at the known transition temperature (7kt ~ 
0.8933 [137]). According to the Nelson-Kosterlitz relation (100), the stiffness exactly at 7kt, for any 
system exhibiting a KT transition, must fall on the line shown; p.s ^ 2T/n. (b) Finite-size data collapse 
according to the combined {T,L) scaling hypothesis (105), using the known Tkt = 0.8933 and with the 
two parameters, a = 1.5 and c = 0.7, chosen to obtain good data collapse. The vertical line shows the 
asymptotic T — > Tkt, °o value expected according to the Nelson-Kosterlitz relation. 



Here T — )■ Tkt corresponds to the argument z = ln(L) —a/{T — Tkt) 

The KT transition temperature of the 2D XY model has been extracted in different 
ways in many studies, e.g., in [133, 137]. Fig. 24(b) shows a test of the scaling form 
(105), using Tkt = 0.8933, as obtained in [137], and with the constants a and c adjusted 
to obtain the (approximately) best data collapse onto a common scaling function for sys- 
tem sizes L = 8, 16, . . . , 128. The data are indeed well described by this form. This kind 
of plot confirms three different aspects of the KT transition (which were theoretically 
deduced at different stages of the history of the KT transition) at the same time; the 
exponentially divergent correlation length (102) [135, 136], the logarithmic correction 
(101) [138], and the Nelson-Kosterlitz relation (100) [139]. 



3.6. Quantum Phase Transitions 



In the following sections of these notes, we will study several examples of quantum 
phase transitions, which take place in the ground state of a system as a function of some 
model parameter [1]. The scaling properties we have discussed above for classical con- 
tinuous and first-order transitions still apply, with some important extensions and modi- 
fications. The nature of a classical thermal phase transition can be traced to singularities 
in the free energy. At a quantum phase transition, it is instead the ground state energy 
which exhibits singular behavior, which is manifested also in other quantities. This can 
be understood as arising from the T >0 free energy of the quantum system, which when 
r — > becomes the ground state energy. That is of course also true classically, but the 
ground states of classical models normally do not evolve continuously as a function of 
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the parameters (as we saw in the example of the frustrated Ising model in Sec. 3.4.2) 
and are therefore strongly first-order. In contrast, as already discussed in Sec. 2.4, quan- 
tum systems have non-trivial ground states, with zero-point fluctuations that evolve as 
the parameters are varied. Continuous phase transitions driven by divergent quantum 
fluctuations are common. 

As we will see in connection with quantum Monte Carlo methods in Sec. 5, a d- 
dimensional quantum system can formally be mapped, using path integrals, onto an 
equivalent classical statistical-mechanics problem in J + 1 dimensions (albeit some 
times with a non-positive-definite distribution function). The size of the system in the 
new "imaginary time" dimension is the inverse temperature; Lx — c/T, where c is a ve- 
locity. At r > 0, this dimension is finite, while the spatial dimensions can be infinite. The 
system is then J-dimensional with a "thickness" L^. The strict {d+ 1) -dimensionality 
applies when also T — )■ 0. In that case, a tunable parameter in the hamiltonian can play 
a role very similar to the temperature in a classical system. Interestingly, changing the 
temperature in this case is analogous to finite-size scaling in [5], which can be used 
to deduce finite-temperature scaling properties close to quantum-critical points [84]. 

In some cases, the J-dimensional quantum system has the low-energy properties of the 
same kind of classical system md+ \ dimensions. This is the case, e.g., for the dimerized 
Heisenberg models discussed in Sec. 2.4. The low-energy physics of these models can 
be mapped onto a 3D classical Heisenberg model, which is normally done via continuum 
field theories, such as the nonlinear a-model [5, 84] (and, it, should be noted, the 
dimerization is only a means of tuning the strength of the quantum fluctuations, which in 
a course-grained uniform effective model is just represented by the coupling constant of 
a field theory, or the temperature in an effective 3D uniform classical model [140]). One 
can then presume that the quantum phase transition driven by tuning the dimerization 
strength should be in the universality class of the temperature-driven transition of the 
3D classical Heisenberg model. In other cases, the low-energy mapping may give an 
effective system that does not correspond to any familiar classical model, but one can 
still say that the system corresponds to some {d + 1) -dimensional effective model. 

The dynamic critical exponent. In many cases, such as the dimerized Heisenberg 
models mentioned above, the time dimension arising in the mapping tod+l dimensions 
is equivalent (in an asymptotic sense) to the spatial dimensions. The scaling properties 
of such a system at a quantum phase transition are then obtained by just replacing d 
by J -I- 1 in the critical correlation function (53), in hyperscaling relations such as the 
second Eq. (59), and in the scalin forms (98) and (99) of the spin stiffness. In other 
cases, the correlations in the new dimension may be fundamentally different from those 
in the spatial dimensions. The dynamic exponent z relates the power-laws associated 
with spatial and temporal correlations, e.g., if the spatial correlation length t, diverges 
when some parameter g is tuned to its critical value as \g — gc\~^^^ , then the temporal 
correlation length diverges as |g — gcK^> and in classical scaling relations one should 
replace J by J + z for the quantum critical point. 

The dynamic exponent derives its name from the fact that it also governs the dis- 
persion (o{q) ~ of excitations of wave-number q. An important aspect of this is 
that the finite-size excitation gap is obtained by setting ^ oc 1 /L, giving the gap seal- 
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ing Al ~ 1 jU-. This result can be used directly in numerical calculations, and also has 
indirect consequences for the scaling properties of quantities that depend on the ex- 
citation spectrum (e.g., various susceptibilities). Extracting the dynamic exponent is an 
important aspect of computational studies of quantum phase transitions. Apart from this, 
quantum phase transitions, both continuous and first-order ones, can be analyzed with 
the same finite-size scaling methods as the classical transitions discussed above. 

It should be noted that there is a dynamic exponent also in classical systems, but 
this does not come into play in equilibrium statistical mechanics (because if a system 
has a kinetic-energy part of the hamiltonian, the associated phase-space probability 
distribution cancels out in expectation values). The classical dynamic exponent depends 
on how dynamics is introduced into the system [141]. The Metropolis Monte Carlo 
algorithm for the Ising model is an example of classical dynamics (called Glauber 
dynamics), and the long-time decay of the autocorrelations at criticality is governed 
by an associated dynamic exponent. This dynamics does not, however, determine the 
equilibrium properties. In quantum mechanics, the dynamics cannot be separated out, 
but is an integral part of the equilibrium properties, because the full hamiltonian always 
enters and contains in it the static as well as dynamic properties [142]. 

4. EXACT DIAGONALIZATION IMETHODS 

By exactly diagonalizing its hamiltonian, complete knowledge of a quantum spin system 
can be obtained — with the eigenstates available, any static or dynamic quantity can be 
computed. In principle, all eigenstates can be computed exactly for a finite quantum 
system, by constructing the hamiltonian matrix and diagonalizing it numerically. In 
practice, however, such exact diagonalization studies are limited to rather small lattices, 
a few tens of spins, because of the exponential increase of the basis size with the number 
of spins (2^ states in the case of S = 1/2). Great care therefore has to be taken in drawing 
conclusions about the thermodynamic limit, which may not even be possible (if the 
available lattices are too small to accommodate the infinite-size physics). Insights gained 
from exact diagonalization studies are nevertheless very useful, in their own right and as 
a complement to other calculations. Exact results for small lattices are also indispensable 
for testing the correctness of, e.g., quantum Monte Carlo programs. In addition, exact 
diagonalization methods provide a concrete path for learning many important aspects of 
quantum mechanics, in particular the symmetry properties of many-body states. 



Block diagonalization. Given a hamiltonian H, the first step of an exact diagonal- 
ization calculation is to choose a basis in which it and other operators of interest will 
be expressed. The working basis for a spin-1 /2 system normally consists of the single- 
spin states and \,u i = I,. ■■ ,N (with the quantization direction normally taken as z). 
However, because of the 2^ growth of the Hilbert space with the number of spins in the 
system, symmetries should be used whenever possible to first reduce the hamiltonian 
to a block-diagonal form, as illustrated in Fig. 25. In such a scheme, the spin states are 
combined and ordered with the aid of applicable symmetry operations. The blocks corre- 
spond to states with different conserved quantum numbers related the symmetries, e.g.. 
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FIGURE 25. Schematic illustration of block diagonalization. In the original basis, the hamiltonian has 
no apparent structure (left). By constructing states labeled by a conserved quantum number, the matrix 
breaks up into blocks (with all matrix elements zero outside the shaded squares) that can be diagonalized 
independently of each other (middle). Applying another symmetry (conservation law), the blocks can be 
further broken up into smaller blocks (right) labeled by two different quantum numbers, etc. 



crystal momentum conservation following from lattice translational symmetry or the 
conserved z-component of the total spin (the magnetization). The blocks can be diago- 
nalized independently of each other, at a much reduced computational cost. In addition 
to the reduced computational effort, immediate access to the quantum numbers is very 
useful for classifying excitations. 

Some symmetries are relatively easy to take advantage of (e.g., the conservation of the 
magnetization) whereas others require some more work and lead to more complicated 
basis states (e.g., momentum states). Some symmetries that could be used in principle 
are normally not implemented, because the practical complications of the calculation 
may outweigh the benefits. Total spin conservation is an example of this. 

The use of symmetries in exact diagonalization can be discussed using the language of 
group theory [143]. This formalism is not necessary (and often confusing), however, and 
here a more practical approach is taken, with no reference to group theory terminology. 
Group theory is actually very useful when dealing with complex lattices, but the power 
of its formalism can perhaps be better appreciated after a thorough understanding of 
symmetry operations and block-diagonalization has been gained through less formal 
methods for simple lattices. Here we consider ID chains and 2D simple square lattices. 

Outline of this Section. Our discussion will first be framed around the 5=1/2 
Heisenberg chain as a concrete example in Sec. 4.1. We first discuss its symmetries and 
introduce a computer representation of the basis states using bits of integers in 4.1.1. In 
4. 1 .2 we use this representation in pseudocodes to construct the full hamiltonian without 
any symmetries, and then block-diagonalize using magnetization conservation. Momen- 
tum states are discussed in 4.1.3, the use of parity (reflection symmetry) in 4.1.4, and 
spin-inversion symmetry in 4.1.5. Complete diagonalization to obtain finite-temperature 
thermodynamic properties is illustrated with some results in 4.1.6. To compute a small 
number of eigenstates for larger chains, the Lanczos method is developed in 4.2. The 
utility of this technique for several ID systems is illustrated in Sec 4.3. The ground state 
and low-energy excitations of the Heisenberg chain are discussed in 4.3.1. The dimer- 
ization transition taking place in the presence of a frustrating second-nearest-neighbor 
interaction is investigated in 4.3.2, and in 4.3.3 an extended model with long-range in- 
teractions is considered. Diagonalization of 2D systems is briefly described in 4.4. Mo- 
mentum states and other square-lattice symmetries are considered. Results pertaining to 
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the Neel ground state of this system are discussed. 



4.1. Diagonalization of the Heisenberg chain 

We will study the 5 = 1/2 antiferromagnetic Heisenberg chain with hamiltonian 

H = y £'s,- • S,+i = y'^' [5^5^+1 + i(5,+5r^i +575+ i)], (106) 

!=0 1=0 

where, for reasons that will become apparent below, it will here be practical to label the 
spins / = 0, . . . , A^ — 1. Periodic boundaries, Sa^ = So, will be assumed when we consider 
momentum states, but before that the boundary condition is arbitrary. 



4.1.1. Representations of states and symmetries 

Lattice transformations. We use the standard notation |5q, . . . ,5^_j) for the basis 
states, with S] from left to right always corresponding to the spin states on lattice sites 
numbered 0, 1,2, irrespective of the ordering of the subscripts i. Thus, if we write a 
state 1 5j , 5q) , this is different from | 5q , 5j ) unless 5q = 5^ . The former could refer to the 
state obtained from the latter when the two spins are switched by a permutation operator 
P, which can be operationally defined so as to affect the site indices; P|5q,5j) = |5j,5q), 
e.g., P\ tl) = I It). Generalizing this to //-spin lattice transformations, reflections, trans- 
lations, and rotations (in two and three dimensions) are defined in terms of permutations 
of the spin indices. As an example, for a periodic chain we define the translation operator 
as moving the spins one step cyclically to the "right"; 

^l'^0''^l'---''^Ar-l) ~ \^]<i-\->^Q-> • ■ ■ -i^N-ll ■ (107) 

This corresponds to decreasing the spin index by one (modulus the system size N) at 
each location / in the ket: S\ — )■ S\_^. When writing specific states with up and down 
spins denoted by t and J,, e.g., | tit 4- • • •)> indices are normally not needed (and, for, the 
sake of compactness of the notation, the arrows are also not be separated by commas). 

The Heisenberg hamiltonian (106) with periodic boundary conditions is invariant with 
respect to translations, i.e., it commutes with T; [//, T] = 0. We can therefore construct 
momentum states |^(^)), which by definition are eigenstates of the translation operator, 

r|^(yt)) =e'*|^(A:)). (108) 

Here the allowed momenta are k = 2n7i/N, with n = 0, . . . , A^ — 1, following from the fact 
that = 1 . States with different k form their own individually diagonalizable blocks 
of the hamiltonian. How the momentum states are constructed in practice and used in a 
computer program will be discussed in Sec. 4.1.3. 

The Heisenberg hamiltonian also commutes with the reflection (parity) operator, 
which we will define in a way generalizing the two-spin permutations already consid- 
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ered, in terms of the spin index transformation N—l—i; 

^I'^O''^!' • • -^^N-l) ~ I'^AT-l' • • • ''^l''^o)" (109) 

For an eigenstate of P, T\^'{p)) = p\^'{p)), where p = ±1 since = 1. We will use 
T and P for block-diagonalization, although, as we will discuss in detail in Sec. 4.1.4, 
they cannot always be used simultaneously because [T, P] = only in a sub-space of the 
Hilbert space. For a system with open boundaries, T is not defined, but P can be used. 

Spin quantum numbers. Since the hamiltonian is spin-rotationally invariant, its 
eigenstates also have to be eigenstates of the square of the total spin, where 

Af-l 

S=J^S,-. (110) 

i=0 

For an eigenstate we have S'^\^f{S)) = S • 811/^(5)) = S{S+ \)\^f{S)). Here there is 
potential for confusion, as the same symbol S is used for both the spin magnitude of 
the individual spins (i.e., = S) and the total spin of a many -body state. The context 
should always make the meaning clear (and we anyway only consider Si — 1/2 here). 

With the total spin conserved, we know that the states can form blocks labeled by the 
quantum numbers (5, m^), where G {—5, —S + 1, ... ,5} is the total magnetization in 
the direction of the quantization axis, 

N-\ 

m,= Y,S]. (HI) 

If we use momentum states, each ^-block of course also splits into smaller blocks, 
(^, S, m^), because [r, S] = 0. It is easy to block diagonalize using m^, but implementing 
the conservation of total S is normally cumbersome (except for a very small number of 
spins) and therefore rarely used (although 5 = states in the valence-bond basis are some 
times used [144]). We will use m^ conservation in combination with lattice symmetries. 
Note that m^ conservation is more general than conservation of the total S — even if we 
introduce some anisotropy in the hamiltonian by giving a different prefactor to, e.g., the 
Ising (diagonal) term in the hamiltonian (106), is still conserved although total S is 
not. All the techniques discussed in this section can therefore be applied directly also to 
such anisotropic models. 

For the special (and most important) case — (for even AO, we can block- 
diagonalize using a discrete subset of all the possible rotations in spin-space; the spin- 
inversion symmetry, i.e., invariance with respect to flipping all the spins. This is defined 
formally by an operator we call Z; 

■^I'^O''^!' ■ ■ ■ ' ^N-l) ~ I ~ "^0' ""^1' • • • ' ~^N~l)' (^ 

For this operator we again have = 1 and the eigenvalues z = ±1. Since Z commutes 
with both P and T, it can be used together with these operators to further block- 
diagonalize H, which we will do in Sec. 4. 1.5. 
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FIGURE 26. The top line shows an = 8 spin state \s) and its representation as the eight first bits of 
an integer s. Note that we label spins / = 0, . . . — 1 from left to right, while the bits are conventionally 
labeled from right to left, as a binary number. To flip two spins /, j, a bitwise exclusive-or (XOR) operation 
with a mask / (middle line) can be used. Bits / and j of / are set to 1, and all other bits are (i.e., / = 2'+-'). 
The bottom line shows the outcome of the XOR operation [ieor (.?,/) in Fortran 90]. 



The total-spin operator can be written in a form resembling the Heisenberg model 
with equal interactions among all the spins; 

S2=£ £SrS; = 2£SrS, + -A^. (113) 

/=0 j=0 i<j 

Constructing the matrix form of this operator, which we need for computing the quantum 
number S, is therefore very similar to constructing the hamiltonian matrix. 

Bit representation of spin states. S = 1/2 models are special because and f spins 
can be represented directly in the computer by the bit values and 1 of an integer. We 
will take advantage of this here. The bits are conventionally labeled starting from 0, and 
that is why we also number the spins in that way. To refer to the bits / = 0, ... ,31 of 
an integer (or z = 0, . . . , 63 for a "long" integer), we will use the notation s[i\ . A basis 
state I^Q, . . . ,'S'^_i) for N spins is thus represented in the computer by an integer s with 
=5f + l/2for/ = 0,...,A^-l and5[/] =Ofor/>A^-l. 

Most computer languages have functions for examining and manipulating bits. The 
off-diagonal terms of the Heisenberg hamiltonian (106) flip two spins. In pseudocodes 
we will accomplish this operation using a bit function flip(i',/, j), which flips (0 ^ 1) 
bits / and j of the integer s representing the state. How this is implemented in practice 
depends on the language used. One possibility is to use a bitwise exclusive-or operation 
with a mask, as illustrated in Fig. 26. In Fortran 90, this operation can be implemented 
as ieor(5, 2'+^). Later, we will also need functions that accomplish the various symmetry 
transformations of the states; translation, reflection, and spin-inversion. 

With standard 4-byte integers, the bit representation with a single integer works up to 
N = 32. Using long integers, one can extend the scheme up to = 64. The latter is well 
beyond the maximum size for which exact diagonalization techniques can be used in 
practice, except for magnetization sectors with 2m^ = n^ — n^ large enough for the block 
size A'^! / (n^ln^l) to be manageable. Magnetization = and other low-m^ sectors are 
typically of primary interest, however. 

Discussing algorithms for constructing the basis set and the hamiltonian matrix, 
we will start from the simplest method, in which no symmetries at all are employed, 
and then implement the magnetization conservation. In Sees. 4.1.3-4.1.5 we include 
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more symmetries. The actual diagonalization of the hamihonian matrix and the use its 
eigenstates to calculate physical observables will be deferred to Sec. 4.1.6. 



4.1.2. Computer generation of the hamihonian 

If we do not make use of any conservation laws, the hamihonian consists of a single 
2^ X 2^ matrix. We then simply use the numbers a = 0, 1 , . . . , 2^ — 1 to label the basis 
states. The bit-values of these integers correspond directly to the spin states. Determining 
the diagonal contributions {a\SjS\_^_^ \a) = ±1/4 to the hamihonian matrix just involves 
examining the corresponding bit pairs a[i\, a[i + 1] (same or different), whereas an 
off-diagonal operator {SfS^^^ +5^^5^J/2 acting on a state \a) with a[i] ^ a[i+l] 
generates the state \b) where the two bits have been flipped. The matrix element is then 
{b\H\a) = 1 /2. For a state with a[i] = a[i + 1] there is of course no off-diagonal matrix 
element. The following piece of pseudocode generates the full hamihonian matrix H: 



do a = 0,2^-1 
do/ = 0,A^-l 
j = mod{i+l,N) 
if {a[i] =a[j]) then 

H{a, a) = H{a, «) + 5 
else 

H(a, a) = H(a, a) — ^ 
b = ffip{a,iJ);H{a,b) = \ 
endif 
enddo 
enddo 



{2} 



Here j — mod( i -I- 1 , A'^) is the "right" nearest neighbor of i, with the mod function taking 
care of the periodic boundary. In Fortran 90, the test for a[{\ = a[j\ can be implemented 
with the boolean function btest(a,z) to examine bit / of a. Each bond operator in the 
hamihonian corresponds to a single off-diagonal matrix element, while the diagonal 
elements have different contributions. Matrices corresponding to other operators of 
interest can of course be constructed in an analogous way, by examining and flipping 
bits according to whatever combinations of S\ and Sf operators that are involved. 

Using fixed-magnetization blocks. Moving up in sophistication, we next implement 
magnetization conservation. We want to construct the block hamiltonian acting on all 
states with given = {n^ — n_i)/2. There are M = N\/{n'^\n_i\) such states, and we 
need a list of them. The order within this list will be used as a label of the states 
of the block, i.e., we write \a) for the a:th state in the list. We then also need a list 
of integers {sa}, the bits of which represent the spin configuration of the a:th state. 
Later, we will have to search the list Sa to find the position label a of a particular given 
state-integer s, and it is therefore practical to make the list ordered; Sa < Sa+i- We will 
some times use the notation l^^) instead of \a) when referring explicitly to the spins, 
\sa) — |'5'a[0] — 1/2, . . . , 5a[A'^ — 1] — 1/2). The context will make it clear if a label inside 
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I) refers to the position in the list of M states or to the integer containing the spins. 

To construct the state list, we loop over the integers 5 = 0, . . . , 2^ — 1 and check 
whether the number of set bits (the number of spins t in the state) corresponds to 
the target sector; — m^ + N/l. After initializing a state counter a = 0, we can use the 
following pseudocode to generate all the states with given magnetization: 

do 5 = 0,2^-1 

if CLi^ii] = "t) ^^^^ a = a+l;sa = s endif {3} 
enddo; M — a 

We now have a basis of size M stored as the integers Sa,a= 1 , . . . , M. 

To construct the hamiltonian, we loop over the labels a — 1, ...,M and use bit op- 
erations as before to act on the corresponding state-integers Sa. When an off-diagonal 
operation on \sa) leads to another state I**) = \sh), and have to find the position b of the 
integer s* in the state list. Since this list is ordered, we can do this by a bisectional search 
in ~ log2(M) steps on average. Such a search proceeds through a series of bracketings, 
where in each step we can halve the possible range of b by examining the state at the 
mid-point of a range [bmim bmax], with the brackets bmin = 1 and bmax = M initially. The 
following subroutine finds the position b of a state-integer s*; 

subroutine findstate(s*,&) 

^min= 1; Vax=^^ {4} 

do 

b = ^min ~l~ (^max ^niin)/2 

if (s* < Sfj) then 

bmux =b-\ 

elseif (s* > s^) then 

b„nn = b+l 

else 

exit 
endif 
enddo 

Division of an integer by 2 should here be regarded in the standard way, i.e., i/2 for odd 
/ equals (/ — l)/2. The exit from the loop occurs when the basis state si, equals the target 
state s*. One can also use more efficient search procedures, using so-called hash-tables 
[145]. The bisection is much simpler, however, and sufficiently fast in most cases. 

Using subroutine {4}, the part of the hamiltonian originating from operation on a spin 
pair (/, 7) in the state \a) can be constructed with the following modification of code {2}; 

if {sa[i\ ^Sa[j]) then 

H{a,a) = H{a,a) + l {5} 
else 

H{a, a) = H{a, a) — ^ 

s* —Mp{sa,iJ); callfindstate(5*,Z7); H{a,b) — \ 
endif 

If one is just interested in obtaining quick results for some very small lattice, or if the 
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system is not periodic (an open chain or a system with random couplings, in which case 
the momentum is not conserved), it may be sufficient to construct the hamiltonian in 
this form and proceed to diagonalize it (successively for all the m^-blocks desired). For 
serious work on translationally invariant (periodic) systems, it is worth implementing 
additional symmetries to further block-diagonalize the fixed blocks. 



4.1.3. Momentum states 

We now construct eigenstates of the translation operator T defined Eq. (107). Trans- 
lating steps brings the spins back to their original state. Thus, — I, which implies 
eigenvalues e'*^, where the set of non-equivalent momenta can be chosen as, 

2n 

k = m—, m = -N/2+l,...,N/2, (114) 

with the lattice constant equal to 1. A momentum state can be constructed using a 
reference state \a) (a single state in the z-component basis) and all its translations; 

It can easily be verified (by a shift of the summation index allowed due to the peri- 
odic boundaries) that operating with the translation operator (107) on this state gives 
T\a{k)) —d'^\a{k)), which is the definition of a momentum state. 

To construct the momentum basis for given k (and normally also given rrf) we have 
to find a set of representatives resulting in a complete set of normalizable orthogonal 
states. Clearly, for two states \a{k)) and \b{k)) to be orthogonal, the corresponding 
representatives must obey T^\a) ^ \b) for all r. Therefore, among all the states of the 
set of translated states \a{r)) = T''\a), r = 0,...,N—l, only one should be used as a 
representative. With the labels referring to the bit representation, it will be practical to 
always choose the representative as the one for which the integer a{r) is the smallest (as 
determined by carrying out all translations). 

Normalization and excluded representatives. If all the translated states T''\a) are 
distinct, the normalization constant in (115) is just Na = A^. Some reference states have 
periodicities less than A^, however, and this affects the normalization. The periodicity of 
a state is defined as the smallest integer Ra for which 

T^"\a)^\a), Rae{l,...,N}. (116) 

If R < N, then there are multiple copies of the same state in the sum in (115), and the 
normalization constant must be modified accordingly. We could then also restrict the 
summation in (1 15) to r = 0, . . . , — 1, but it is more practical in formal manipulations 
of the states to keep all A^ terms regardless of Ra. 

An important aspect of the momentum basis is that the periodicity of the representa- 
tive has to be compatible with the momentum in order for (1 15) to be a viable state. The 
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compatibility is related to normalizability. The sum of phase factors associated with the 
representative state \a) in the sum in (1 15) is 

F(yt,i?J=''v~'e-^«^« = | ii'/^- if^^« is a multiple of 2;r, 
^ ' 10, otherwise. ^ ^ 

n=0 ^ 

The normalization constant is then 

Na = {a{k)\a{k))=Ra\F{k,Ra)\\ (118) 

and, therefore, if F{k,Ra) = 0, no state with momentum k can be defined using the 
reference state \a). Thus, for given \a) the allowed momenta are those for which kRa is 
a multiple of 271, or 

k=^m, m = 0,l,...,Ra-l. (119) 

Ra 

For the allowed momenta, the normalization constant in (1 15) is 

Na=-^. (120) 

When the reference state is not equal to any non-trivial translation of itself, then Ra = N 
and Na — N. Note again that a given reference state can only appear in a single basis 
state. It is then clear that the momentum states are orthonormal; {b{l</)\a{k)) = datSkkf- 
Also note again that reference states that are not compatible with a given momentum will 
not appear in that block of states. An important aspect of constructing the momentum 
basis is to check the compatibility of a potential representative state with the momentum. 

The hamiltonian matrix. Next, we construct the hamiltonian matrix in the momen- 
tum basis. The periodic Heisenberg hamiltonian under consideration here is translation- 
ally invariant and consists of bond operators S, • S,_|_i. It is convenient to lump all the 
diagonal terms together and consider the off-diagonal terms separately. To simplify the 
formalism to follow, we define operators accordingly; 

N 

Ho=Y^S)S)^„ (121) 
Hj^^{S+_,Sj+S+_,Sj), 7 = 1,...,A^, (122) 

so that H = JYlj=oHj. We now set / = 1. We need to find the state resulting when H 
acts on the momentum state (115). Since [//, T] = we can write 

1 A'-l 1 N N-1 

H\a{k)) = ^ I c-^^-rHla) = ^ £ £ e--^T'-//» , (123) 



and we need to operate with the hamiltonian operators Hj only on the reference state. 
For each operation we get a different state, or, in the diagonal (j — 0) case, the same 
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state. In either case we can write Hj\a) = hj{a)\b'j), where hj{a) is the matrix element 
coming from (121) or (122), and we do not, for simplicity of the notation, include any 
explicit indicator that \b'j) also depends on \a). The prime in \b'j) is there to indicate 
that this new state is not necessarily one of the reference states used to define the basis 
and, therefore, a momentum state should not be written directly based on it. Provided 
that \b'j) is compatible with the momentum, there must be a reference state \bj) which is 
related to it by some number of translations; 

\bj) = T'i\b'j), (124) 

and using this relation we have 

Hj\a) = hj{a)T~^j\bj), Ij e {0,1,. .. ,N - I}. (125) 

Here the notation is again simplified by not making explicit that Ij depends on the actual 
state \bj) (and therefore on |a)). We can now write (123) as 

=1^1 e--*T(-'^)|Z,,-), (126) 

and by shifting indices in the summation, and also noting that \bj) may have a different 
normalization factor (periodicity) than |a), we obtain 



N 



H\a{k)) = £ hj{a)c-^'^u r^\bj{k)). (127) 

j=o V 

We can now simply extract the matrix elements of the hamiltonian operators Hj; 



{bjik)\Hj\aik))=hjia)c-'''^^^. (128) 

Strictly speaking, for the off-diagonal bond operators (j > 0), this is not the only matrix 
element, because individually these are not translationally invariant operators. They 
therefore have matrix elements also between states with different momenta. Here we 
have in mind summing over all j, after which only the elements diagonal in k survive. 

There can be several terms in H that contribute to the same matrix element (128), 
because it is possible (in fact very likely) that Hj\a) oc T^^'\bi) and Hj\a) oc T^^j\bj) 
with \bi) — \bj) (but /(• ^ Ij). Note also that the matrix element hj{a) in (125) is zero 
for an off-diagonal operator acting on two parallel spins. It should also be kept in mind 
that \bj{k)) may not exist even if hj{a) > in in (125), if \bj) is incompatible with 
the momentum. The momentum- state matrix element (128) then does not exist. With 
these caveats, which we will deal with in the implementations below, (128) specifies 
all the non-zero matrix elements of the hamiltonian. For the specific Heisenberg model 
considered here, we can substitute hj{a) by the actual values of the diagonal and off- 
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FIGURE 27. Bit representation of the 8-spin state \s) = \ tiittiit) (top row) along with its transla- 
tions, T''\s), with r = l,...,R, where R~4is the periodicity. The integers s{r) correspond to r successive 
cyclic permutations (to the left) of the bits. The representative is * ( 1 ) = 5 1 ; the lowest integer in the set. 



diagonal matrix elements and obtain 

(129) 



bj)ocT-^JHj\a). (130) 

Matrix elements of other translationally invariant operators (which is what should 
be used in the momentum basis) can be obtained in exactly the same way if the mo- 
mentum transfered by the operator is (i.e., a sum of identical local operators Oi, 
i = 0, . . . ,N — I). For an operator Aq which transfers momentum q ^ (such as S^, 
the Fourier transform of S?) the procedures differ only in that the basis states obtained 
when operating, Aq\a{k)), have momentum /: + ^ (and in that case the basis sets for both 
momenta involved have to be stored). 

Constructing the momentum-state basis in a program. We will now again use a label 
a to refer to the position of the state in the basis, and store the corresponding spins in 
the form of the bits of integers Sa,a= 1, . . . ,M. A momentum state is defined in terms 
of its representative \sa), from which the full state is generated using the translation 
operator T according to (115). In this case we do not know the basis size M a priori, 
because the number of compatible representatives depends on the momentum. If all the 
translations of all states were unique, and there were no states incompatible with the 
momentum, then the number of states would equal [N\/{n^\ni{)\/N. However, since 
many states have periodicities less than N and some are incompatible with k (unless 
k = 0), this is only an approximate basis size. The k = momentum block is always 
the largest, because there are no states incompatible with this momentum. For large 
systems the fractions of disallowed states and periodicities <N are small, and the above 
approximation to the basis size is then quite good. 

To implement the momentum basis in a computer program, we first need to decide 
how the representatives are chosen — in principle the representative of a state I*) could 
be any one of the members of the group of states related to it through translations; 
\s(r)) = T''\s),r = 0, . . . ,N — 1. Fig. 27 shows an example of all the unique translations of 
an 8-spin state in the bit representation. We have to construct a list of the representatives 
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and should be able to easily identify the representative corresponding to an arbitrary spin 
state. As already mentioned, when using the bit representation it is natural to pick as the 
representative the translated state \s{r)) for which the integer s{r) is the smallest. 

To generate the basis, we loop over the integers 5 = 0,.. .,2^ — 1, and, as in code {3}, 
process only those corresponding to a chosen magnetization. The operations needed to 
determine whether a state is a valid new representative are carried out by bit operations. 
When translating I*), if some ^(r > 0) < 5 = ^(O), then the representative of \s) is 
already in the basis and should not be used again [as in Fig. 27, where the original 
integer .y = .v(0) = 153 but the translation 5(1) = 51 < ^(0)]. If all s{r) > s{0), then the 
representative of I*) is not yet in the list. However, I*) may still not be allowed, due to 
its periodicity potentially being incompatible with the momentum, according to (1 19). 

In the pseudocode segments below, the momentum will be represented just by the 
integer e {-N/2 + 1 , . . . ,N/2} multiplying Itt/N in (114), which we here call k. We 
will define a subroutine which checks whether or not a state-integer 5 is a new valid 
representative and, for a valid representative, delivers its periodicity R. To carry out the 
translations of the state \s), we use cyclic permutations of its bits. We assume that this 
is accomplished with a function cyclebits(z,n), which performs a cyclic permutations 
to the "left" (as exemplified in Fig. 27) of the n first bits of the integer /. In Fortran 
90, exactly this operation is available with the intrinsic function ishftc(5, J,n), which 
cyclically right-shifts the first n bits of s hy d steps. The definition of T according to 
(107) corresponds to J = — 1. Our state checking subroutine can be implemented as: 

subroutine checkstate(5,i?) 

R = -l; t = s {6} 
do j= 1,A^ 

t = cyclebits(r,A^) 
if (t < s) then 

return 
elseif {t — s) then 

if (mod(it,A^//) ^ 0) return 
R = i; return 
endif 
enddo 

Here the integer R is first initialized to —1, and this value will be returned (and later 
used as an indicator of a disallowed representative) unless the input state-integer s is 
the smallest among all the translated integers and the periodicity is compatible with 
the momentum (and then R will equal the periodicity upon return). We can now easily 
construct the list of M basis states; 

do 5 = 0,2^-1 

call checkstate(5,i?) {7} 
if i? > then a — a+l; Sa — s; Ra — R endif 
enddo 

M = a 

The list Ra of periodicities will be needed when constructing the hamiltonian. 
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Constructing the hamiltonian matrix. To generate the hamiltonian, we loop over the 
representatives \sa), a=l,...,M. For each of them, we check the bits corresponding to 
all nearest-neighbor pairs j). The diagonal matrix element can be handled exactly as 
in code {5}. For the off-diagonal part, we need a few minor modifications. After two 
spins have been flipped, the resulting state l^') is typically not a representative, and we 
need to find it using Eq. (124). The matrix element (130) also requires the number Ij of 
translations used to bring I*') to its representative l*^). We implement these tasks as a 
subroutine, the contents of which are rather similar to the subroutine checkstate, code 
{6}, that we used when constructing the basis. Given the state-integer .y, we translate 
its bits in all possible ways and store the corresponding smallest integer r (the potential 
representative) found so far, along with the corresponding number of translations, /; 

subroutine representative r, /) 

r = s'; t = s'; 1 = {8} 
do/= 1,A^-1 

t = cyclebits(?,yv) 

if {t < r) then r = t; I = i endif 
enddo 

Having found the representative r, we need to locate its position b in the list {sa}- This 
is done in the same way as before, with the subroutine findstate, in code {4}. Note, 
however, that because of the periodicity constraint imposed by the momentum, it is now 
possible that the potential representative r is actually not present in the list. Therefore 
findstate is slightly modified, so that Z? = — 1 is returned if there is no element r in the 
list, using the following piece of code after the if.. .endif statements in code {4}; 

if (^min > ^max) then 

&=—l; return {9} 
endif 

For each spin flip, we can now add the contribution to the hamiltonian with the following 
code replacing the next-to-last statement in code {5}; 

call representative r, /) { 10} 

call findstate (r,/?) 

if {b>0) H{a,b)=H{a,b) + \{Ra/Rby^^eMi2^kl/N) 
The hamiltonian is complex, except when k = 0orN/2 (actual momentum or 7U). 



4.1.4. Reflection symmetry and semi-momentum states 

We will next consider in addition to the translated states T^\a) those that are generated 
by the reflection (parity) operator P defined in (109). The operators T and P do not 
commute, and so it would at first sight appear that we cannot construct states that are 
eigenstates of both operators simultaneously. These operators do, however, commute in 
the k = Q,% momentum blocks, as we will show explicitly below. In addition, we will 
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construct semi-momentum states that are also parity eigenstates for any k. An advantage 
of such states is that they (and the hamiltonian) are real-valued, in contrast to the standard 
complex momentum states. 

States with parity. Consider the following extension of the momentum state (115); 

\a{k,p)) = 4f +pP)\a). (131) 

where p = ±1. Clearly, this is a state with momentum k [i.e., it satisfies Eq. (108)], but 
is it also an eigenstate of P with parity pi We can check this by explicit operation with 
P, using p2 = xpi = 1^ and the relationship PT = T'^P: 

P\a{k,p)) = ^ I e-'^T-'-(P + p)|a) 

V^^a ,-=0 

= P^''t^'''T\l+pP)\a). (132) 

This is not exactly of the form (131), unless A: = or ;c, for which e'*^'' = e"'^'^ (i.e., 
the momentum k is equivalent to —k, and there is no directionality associated with the 

state). Thus, in these two special cases, parity and translational invariance can be used 
simultaneously for block-diagonalization and \a{k,p)) is indeed a momentum state with 
parity p (or, in other words, [T,P] = in the sub-spaces with momenta k = Q and %). 

Semi-momentum states and parity. Except for the special cases k = 0.,K, parity 
cannot be used to further block diagonalize a momentum block of H. We can, however, 
use parity in combination with the momentum in a different way, by mixing momentum 
states with We consider the sum and difference of these states; 

\a''{k)) = ^''Y.Ct{r)r\c^)^ (133) 
where <7 = ±1 and we have, for convenience, introduced a function C^{r); 

^k^'^'~\^m{kr), a = -\. ^^^^^ 

We we will here refer to k in (133) as the semi-momentum. Note that a is not a 
conserved quantum number, just an indicator for how the momentum states have been 
combined into semi-momentum states. Strictly speaking, the special values A = 0, ;r are 
still conventional (crystal) momenta (and the a = —1 states do not exist for these k) and 
only k in the range < A: < ;r should be referred to as a semi-momenta. We will here 
consider all < A: < ;r on the same footing (where it should be noted that only half of 
the first Brillouin zone is used, as the other half corresponds to the same states). The 
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nonnalization constant is different for the special cases; 



where we have introduced the factor 

In practice, gk will not matter here, because we are only considering matrix elements 
of H, which are diagonal in k. The g-factors therefore cancel out in the ratios of 
normalization constants appearing in the matrix elements. 

Checking overlaps, states with the same k but different a are orthogonal, 

{a-''{k)\a''{k)) = ^ £ sin(A:r)cos(A:r) = 0, (137) 

and other requirements for orthonormality, {a'^ {k')\a'^ {k)) ~ daidkki, of the semi- 
momentum basis can also easily be verified. The advantage of semi-momentum states 
is that they are real- valued for all k, in contrast to the complex momentum states. 

The following equalities — standard trigonometric identities — are useful when manip- 
ulating semi-momentum states; 

C±(-r) = ±C±(r), (138) 
C±(r + J) = C±(r)C+(J)Tq^(r)q-(J), (139) 

where ± stands for a = ±1. Using (139) it is easy to see that the Hamiltonian acting on 
a semi-momentum state mixes a = ±1 states. With Hj\a) — hj{a)T^^i \bj) we get 



H\aHk)) = E/^;(«)^^(q+(/y)I^^W)Tq-(/;)I^TW)), (140) 
from which the hamiltonian matrix elements can be extracted and written as 



{b\k)\Hj\a-{k))^hj{a)T^^~')I^^^Cr (141) 

Incorporating parity. Since the hamiltonian (141) is not diagonal in <7, T, the number 
of states in a semi-momentum block is twice that in a conventional momentum block, 
and thus it would appear that there is not much to be gained over complex momentum 
states by making hamiltonian real in this way (by making the states real). However, we 
can also incorporate parity in a semi-momentum state, by defining 

= ^ I'cf (r)(l+pP)r|a). (142) 
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This is also a semi-momentum state, because the reflected component P\a^{k)) — 
o\Pa'^{k)) is the semi-momentum state obtained by using the reflected representative 
state P\a) instead of It can also easily be verified that (142) is an eigenstate of the 
parity operator for any semi-momentum; P\a^{k^p)) = p\a^{k^p)) with p = ±.\. This is 
simply due to the fact that in (142) the operator (1 + pP) appears before the translation 
operators T'^, in contrast to the momentum state (131) where (1 -|- pP) is written after 
T''. The hamiltonian is thus diagonal in p, and the number of states in each {k, p) block 
is roughly half of that in the original semi-momentum k blocks. We are then back to the 
same block size as with the conventional momentum states, but with states with purely 
real coefficients. 



Orthogonality and normalization of semi-momentum states. Examining the orthog- 
onality of the states (142), we have an apparent problem: When T'^P\a) = \a) for some 
m, the states \a'^{k,p)) and \a~{k,p)) are not orthogonal. We can then useEqs. (138) and 
(139) to write the parity-conserving semi-momentum state (142) as a linear combination 
of non-parity semi-momentum states (133); 



|a±(^,p)) = ^^((l±pC+(m))|a±(^))-pC-(m)|aT(^))). (143) 

It is then clear that {a^^ {k,p)\a^ {k,p)) can be non-zero. To look at this more closely, 
we first determine the normalization constant A'^ in (142). Note that we have attached 
(7 as a superscript to indicate that the normalization constant of the parity-conserving 
semi-momentum states can, unlike the normalization (135) of the plain semi-momentum 
states, depend on a (in addition to the implicit dependence on k, p). In the case T"^P\a) ^ 
I a) for all m, the calculation of A'^ is trivial. When T"^P\a) — \a) for some m we can use 
(143) and the orthonormality of the pure semi-momentum states \a^{k)), resulting in 

A..=^x|'' „ , ^ !"! (144) 

Ra [ 1 + C7pcos(/cm), T'^P\a)^\a) ^ ' 



In the same way, we can calculate the overlap between the <7 = ±1 states when T^P\d) = 
I a) for some m and find 

{a^{k,p)\a^{k,p)) = -p, {T P\a) = \a) for some m). (145) 

This is of course under the assumption that both the C7 = ±1 states exist (i.e., ^ 0). 
Thus, the o — ±1 states in the case T'^P\a) — \a) differ at most by a sign, and we 
should include only one of them in the block of states. For definiteness, we can choose 
\a'^{k,p)) if 7^ and \a^{k,p)) else. We still have to pay attention to the non-zero 
overlap (145) in some formal manipulations with the semi-momentum states, as we shall 
see shortly. 

The semi-momentum hamiltonian. To calculate the hamiltonian matrix elements, 
we first note that, instead of (124) for a plain momentum state, now when we act 
with a hamiltonian operator Hj on \a) we get a state \b'j) which is related to another 
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representative state \bj) by a number of translations and possibly a reflection: 



\b^) = Thpii\y.)^ /.e{o,...,A^-i}, ^,-€{0,1}. 



(146) 



The result of H acting on a semi-momentum state (or a = 0, ;r momentum state with 
parity) can therefore be written in in the form 



j=0 r=0 

where the representative \bj) is related to Hj\a) 

Hj\a) = hj{a)P^iT-^^\bj), e {0, . . . ,Ar- 1}, ?, e{0,l}. 
Using the relation (139) we can write (147) as 



(147) 



(148) 



N 



H\a<'{k,p)) = l^hj{a){Gpr^\l^ 



X 



(149) 



cos{klj)\bj{k,p))-o, 



L-sm{klj)\b-^{k,p))]. 



The ratio of the o — ±1 normalization constants is 1 if T"^P\bj) ^ \bj) for all m, and 
otherwise it can be written as 



1 — Gpcos{km) 



sm{km) 



(150) 



^ ^bj Y 1 + crpcos(^m) l + opcos{km) 
Using the overlap (145) we can extract the matrix elements. The ones diagonal in a are, 



{bJ{k,p)\Hj\a^{k,p)) = hj{a){cpY^\j ^ 



X 



cos{klj), P\bj)j^T'-\bj), 

cos{klj)+apcos{k[lj~m]) p\h ■) = T"^\h ■) 



(151) 



whereas the off-diagonal ones are 



{bJ^{k,p)\Hj\a<'{k,p)) = hj{a){apy^^ ^ x 

-osm{klj), P\bj) ^T"'\bj), 

- a sinjklj ) +p sin {k[lj -m\)\ pl^A^^ml^A 



(152) 
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FIGURE 28. A state |s) = 1 114-114-4-4-) (upper left) along with all its transformations by the translation, 
T, and reflection, P, operator in the bit representation. The corresponding base-10 integer s and its 
transformations are also shown-the smallest integer, s = 27, corresponds to the representative state. For 
this particular state, T^P\s) = \s), i.e., m = 5 in the normalization constant (144). 



Degeneracies. In the conventional momentum basis, states with ±k are degenerate, 
but are orthogonal states (except when k^O,K).ln the semi-momentum basis, ±k states 
are really the same state, differing at most (in the case a = —1) by a sign (and that is 
why the semi-momentum Hilbert space only includes < A: < ;r). The degeneracy has 
here been moved to the parity sector, with the p±l states being degenerate (but still 
orthogonal) for k^O.n. This can be seen by letting (;?, c, t) — )■ — (p, c, t) in the matrix 
elements (152), which leaves unchanged the set of matrix elements for t, a = ±1. Thus, 
in calculations we need to consider only, e.g., p = +lfork^O,n (while for k = 0,nwQ 
only have <7 = T = 1 and the p = ±1 sectors are not degenerate). 

Constructing the basis of semi-momentum states. Generalizing the convention we 
used for momentum states, we now choose as the representative of a state \s) the state 
\s{rjq)) = T''P^\s) for which the corresponding integer ^(r, ^) is the smallest (where 
r G {0, . . . — 1}, G {0, 1}). States can now be disallowed in a given block not only 
due to incompatibility with the momentum, but also because of restrictions imposed by 
the reflection quantum number p and the non-conserved state label o. We thus have 
to determine which one, or both, of the (7 — ±1 semi-momentum states (142) should 
be included, according to the conditions discussed above. Fig. 28 illustrates all the 
combination of translations and reflections in the bit representation of a state which 
is related to its own reflection according to js) = T^P\s), in which case only the (7 = —1 
or (7 = -1-1 variant of the state should be included in the basis (and the state is compatible 
with all momenta, because its periodicity R = N). 

When both a = ± 1 states are required, we will store two consecutive copies of the 
same representative, so that we can continue to use the location in the list Sa as the 
label of each state within the block. We then also need to store the o labels for each 
representative, as well as the number of translations m by which the reflection of a state 
is brought back onto itself, which is needed in the normalization and the hamiltonian. 

As before, we will use a subroutine checkstate to determine whether or not a state 
is a new representative to be added to the list. In addition to the periodicity R of a new 
representative, this subroutine now also delivers the reflection-translation number m, 
when applicable. We can use m— —I as a flag indicating that there is no m for which 
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T'^P\s) = \s) (and otherwise m > 0). The operations on the reflected state are carried out 
after the instructions in {6} have been performed, by first reflecting the spin bits in the 
state integer s and storing it as r , ? [/] = 5 [A^ — 1 — /] for / = 0, . . . , — 1 . This reflection is 
accomplished by a function reflectbits(5, A'^). Then the resulting integer is translated as 
before. We thus modify code {6} according to: 

subroutine checkstate(5,i?,m) 

{11} 

r = reflectbits(5',A^); m — —l 
do/ = 0,i?-l 
if {t < s) then 

R = —1; return 
elseif {t — s) then 
m = i; return 
endif 

t = cyclebits(r,A^) 
enddo 

Here ... represents all operations in code {6}. Note that the loop in the code above starts 
at i — 0, and the translations are carried out after comparing t with s, since a state can 
be its own reflection (in which case m = 0). Also, since we have already determined 
the periodicity R of the state, we need to go only up to / = — 1 in this loop. The 
function reflectbits has to be implemented by hand; a corresponding internal function is 
normally not available. In Fortran 90 one can use the functions btest(j,Z7), setbit(j,Z7), 
and clrbit(/, b), to examine, set, and clear individual bits & of an integer /. 

A state which has passed the above checks is still not necessarily a valid representative 
for both (7 = ±1, and the state must therefore be further examined to determine which 
(one or both) of the a = ± 1 copies of the representative should be added to the basis. 
The following code segment stores the state information for <7 = ±1 states that are to be 
included in the block based on the criteria discussed above; 

call checkstate(5, R, m) 

doc7 = ±l (doonly (7 = +l ifA: = 0orA: = A/'/2) {12} 
if (m 7^—1) then 

if (1 + opcos{ikm2K/N) = 0) 7? = -1 

if {a = -1 and 1 - apcos{ikm27t/N) ^0)R=-l 
endif 

if i? > then a = a+l; Sa = s; Ra = oR; ma=m endif 
enddo 

Recall that here m = — 1 means that there is no m such that T'"P\s) = \s), whence both 
C7 = ± states should be included. On the other hand, if there is such an m, then only 
one of the states is included — we pick a = 1 if the normalization constant (144) for 
it is non-zero. This is accomplished above by the two consecutive if statements (where 
i? = — 1 is set for an invalid representative). We have to store o, and since the periodicity 
R is positive and non-zero, we can store it and o jointly as oR in a list Ra- A more dense 
packing of R, a,m into a single integer is of course also possible. 
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Constructing the semi-momentum hamiltonian. For general semi-momentum, the 
same representative can appear once or twice in the list Sa, whereas for the special cases 
^ = 0, ;r there is always only a single (a = 1) copy of each representative. We here 
construct code that can treat both cases, but simpler code can be written for the two 
special momenta. When building the hamiltonian by looping over the state indices a, 
with the corresponding representatives Sa, we first check the previous and next items in 
the list. If the previous item Sa-i = Sa, then we skip over this representative, because it 
will already have been taken care of in the previous pass through the loop. If the next 
representative, Sa+i equals Sa, then we store as « = 2 the number of same representatives, 
and else set n = 1. This part of the hamiltonian building is accomplished by; 

do a = 1,M 

if (a > 1 and Sa = Sa-i) then {13} 

skip to next a 
elseif (a<M and Sa = Sa+i) then 

n = 2 
else 

n=l 
endif 

enddo 

Here the skip command skips to the next iteration of the loop and ... stands for the bulk 
of the loop, the main features of which we discuss next. 

Considering first the diagonal matrix elements H{a, a), for a given a we now first sum 
up all the diagonal contributions from the bit configurations in Sa and store the result as 
E^. Taking into account the possibility of one or two entries of the representative (n= 1 
or 2), we can use the following loop to assign the diagonal matrix elements accordingly; 

do i = a, a + n— I 

H{a,a)^H{a,a)+Ez {14} 
enddo 

Turning next to the off-diagonal operations, looping over site pairs (ij), if the bits 
Sa[i\ 7^ Sa[j], then the spins are flipped, resulting in a state-integer named s' as in code 
{10}. We again have to identify the corresponding representative state |r) = T^P'^\s') 
by carrying out all the symmetry operations to determine the transformation indices / 
and q appearing in the hamiltonian matrix elements (151) and (152). The subroutine 
accomplishing this is a simple modification of code {8}; 

subroutine representative ( 5' , r,l,q) 

{15} 

? = reflectbits(/,A^); ^ = 
do j= 1,^-1 
t = cyclebits(?,iV) 

if {t < r) then r = t; l = i; q=l endif 
enddo 
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Here ... stands for the code after the header in {8}. After having called this subroutine, 
we can search for the representative r in the list {sa}- This is done in the same way as 
we did before, with the subroutine findstate(r, b). 

Since the hamiltonian is more complicated than in the standard momentum basis, we 
also define a function helement((2, b, /, q), which returns the matrix element according to 
(151) or (152). The function takes as input the labels a and b (locations in the matrix) of 
the two representatives and the transformation indices /, q delivered by representative. 
The matrix element also of course depends on the momentum k and the parity quantum 
number p = ±1, which we do not indicate explicitly here. The function is straight- 
forward to implement based on (151) and (152), and we do not list any code here. 

As with the state-integer Sa in code segment {13}, we also here need to take into 
account that the representative si, can appear once or twice (indicated below with m—\ 
or 2) in the list of representatives. In case there are two copies, we do not know whether 
the subroutine findstate has returned its first or second location. Thus, we again need 
to examine positions in the list of representatives adjacent to the one delivered. The 
following code segment does all that and then assigns the matrix elements: 

5' = flip(5fl,/,7) 

call representative ,r,l,q) {16} 
call findstate (r,/?) 
if {b > 0) then 

if (& > 1 and Sb = S},-i) then 

m = 2;b = b — l 
elseif (b <M and = s^+i) then 

m — 2 
else 

m = 1 
endif 

do j — b,b + m— I 
do / = a,a + n — 1 

H{i, j) = H{i, j) + helement(/, j, I, q) 
enddo 
enddo 
endif 

This piece of codes replaces the much simpler four- line code segment {10} for the pure 
momentum basis. The advantage is that the matrix H is now real-valued. 



4.1.5. Spin-inversion symmetry 

Spin-inversion symmetry, with the operator Z defined in (1 12), can be used to reduce 

the hamiltonian block size in the magnetization sector = 0. The fact that the magne- 
tization is conserved implies, however, that there is nothing to be gained by using this 
symmetry in the |m^| > sectors (since such a basis would consist of mixed ilm^l states, 
and the blocks of symmetric and anti-symmetric combinations would be of the same size 



Computational Studies of Quantum Spin Systems 



January 18, 2011 



75 



as the original blocks). In models that do not conserve m^, e.g., the transverse-field Ising 
model, spin inversion can be exploited for all states. We denote by z the eigenvalue of Z. 
Since = 1, we again have z = ±1. 

Unlike the parity operation considered in the previous section, the spin-inversion 
operator Z in (112) commutes with the translation operator, [r,Z] = 0. The associated 
quantum number z is therefore conserved together with k in all momentum sectors. In 
the magnetization sector = 0, we can therefore always split a momentum block into 
two smaller ones by using states of the form 



\a{k,z)) 



1 



N-l 



£e-'^'-r(l+zZ)|a), 



(153) 



a r=0 



where the normalization constant is easily obtained as 



" Ra ^ 1 l+zcos(A:m), 



r'^Zla) ^ \a) Vm, 
T'^Zla) = \a). 



For a hamiltonian operation on | a) resulting in 

Hj\a) = hj{a)ZSJT-^J\bj), Ij e {0,...,N -I}, gye{0,l}, 
we obtain the matrix element 



{bj{k,z)\Hj\a{k,z)) = hj{a)z^iQ-'^^u 



11% 



(154) 



(155) 



(156) 



which is valid for any k. As always, for 7 = this reduces to just ho{a). 

Spin-inversion symmetry with semi-momentum states. We can also consider semi- 
momentum states incorporating spin-inversion symmetry; 



1 



A'-l 



a<'{k,p,z)) = -y=l^C^{r){l+pP){l+zZ)r\a), 



(157) 



These states are eigenstates of Z as well as P. 

When calculating the normalization and constructing the hamiltonian we now have 
to consider five different types of reference states, depending on which combinations of 
symmetry operations transform the reference state into itself; 



(158) 



Here the inequalities should hold for all m and the equalities for some m,n. The calcu- 
lations follow the procedures of the previous section, and we just list the results. The 
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ym^ 
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jmp2 


a)^ 


a), 
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\a), 


ym^ 
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jmp2 
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nonnalization constants are: 



2N^ 

Ragk 



1, 1) 

1 +opcos{km), 2) 

X < 1 +zcos(^m), 3) 

l + opzcos{km), 4) 

^[l + opcos{km)][l+zcos{kn)]. 5) 



(159) 



For cases 2), 4), and 5), only one state out of a pair with c = ±1 should be included in 
the basis as the two states within such a pair differ merely by a factor. To be definite, we 
can again pick CJ = 1 if that makes A^^ > and o = — I else. Acting with a hamiltonian 
operator on a representative now leads to a new representative \bj) according to 



Hj\a) = hj{a)P^^ZS^T-h\bj). 
The a-diagonal matrix elements of H are 



{bJ{k,p)\Hj\a''{k,p)) = hj{a){apy^z^^ 
cos{klj), 

cos{klj)+<7p cos{k[lj—m 

l+(jpcos(km) ' 
cos{klj)+(jpzcos{k[lj—m]) 
l+apzcos(km) 

whereas the ones off-diagonal in a are, 



{bj''{k,p)\Hj\a^{k,p)) = hjia){Gpy^z^^ 
-osm{klj), 

-asm{klj)+psm(k[lj—m]) 



(160) 




(161) 



l — (jpcos{km) 
— (T sin (fc/y ) + pz sin {k[lj - 



■m]) 



I — (7 pz cos (km) 




(162) 



These expressions may seem rather complicated, but it should be noted that they are 
completely general for ID systems, not just for the Heisenberg chain considered here. 
Once they have been implemented and tested for some model, the code can easily be 
reused for other systems. 



Program implementation; combining all symmetries. Incorporating spin-inversion 
symmetry in the semi-momentum basis with m^. = 0, with states \a^{k.,p,z)) defined in 
(157), we have to augment the subroutine checkstate in code segment {11}. It should 
return the translation numbers corresponding not only to reflection P of a representative, 
T'^pP\s) — \s), but also translation numbers in symmetry relationships involving Z and 
PZ; T^^Z\s) = \s) and T"^p^PZ\s) = \s). To accomplish the spin inversion in the bit 
representation, we define a function invertbits(5, A^) which flips all the bits .v[/] — s- 
1 — s[i], i = 0,N — I. This can be easily accomplished without individual bit- level 
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TABLE 1. Size of the k = state blocks for magne- 
tization = and different parity and spin-inversion 
quantum numbers {p,z)- 



N 


(+1, + 1) 


(+1,-1) 


(-1, + 1) 


(-1,-1) 


8 


7 


1 





2 


12 


35 


15 


9 


21 


16 


257 


183 


158 


212 


20 


2518 


2234 


2136 


2364 


24 


28968 


27854 


27482 


28416 


28 


361270 


356876 


355458 


359256 


32 


4707969 


4690551 


4685150 


4700500 



operations as s = — s (for A'^ less than the number of bits in the integers used; 
otherwise a different formula has to be used). Codes {6} and {11} can be easily modified 
to also check the inverted and reflected- inverted states to determine and m^p. 

If the modified checkstate returns a potential representative, — I, then again the 
state has to be examined further to determine whether it satisfies the further criteria for a 
representative for one or two states cr = ±1 . An allowed representative now falls into one 
of five different classes depending on its symmetry properties as summarized in (158). 
The class, c e {1,...,5} can be determined from the translation integers mp^m^^m^p 
delivered by checkstate. After c has been determined, the three translation integers can 
be reduced to two; the m and n in the normalization constants (159). The class c can be 
packed along with m,n into a single integer, e.g., as m + n(A^+ \)-\-c{N -\- 1)^, which is 
stored along with the periodicity and a index packed as oR as before. 

For the construction of the hamiltonian, the subroutine representative, the main parts 
of which are described in code segments {8} and {15}, has to be amended further in 
order to also return an index g corresponding to the number, or 1, of spin-inversions 
needed, along with q reflections and / translations, to transform the state 1 5) generated by 
a spin flip into the corresponding representative; \r) = T^jP'iiZ^i\s) The construction 
of the hamiltonian then proceeds exactly as before, with a modification only of the 
function helement to include also g as an argument in code segment (5.15). This is 
straight-forward, and there is no need to list any code here. 

Examples of state block sizes. This is as far as we will go with applying symme- 
tries for ID systems. For the Heisenberg model, the total spin is also conserved, but 
incorporating that symmetry in the basis is much more complicated. In principle it can 
be done for total spin singlets, using the valence-bond basis [144], but it is rarely done 
in practice because the resulting hamiltonian is very dense, unlike the sparse matrices 
obtaining with the symmetries implemented here (the number of non-zero elements is 
proportional to M x for an M x M matrix, and <^ N). The sparseness will be very 
useful in the Lanczos calculations discussed in Sec. 4.2. 

Table 1 shows the size of the Hilbert space blocks for several different chain sizes 
N when all the symmetries are used. Here the momentum k = Q and the number of 
states is shown for all combinations of the parity and spin-inversion quantum numbers 
(p = ±l,z = ±l). The largest block is always the fully symmetric one. As the system 
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size grows, the relative variations in the block size diminish, and for large A'' all sizes are 
rather well approximated hy N\/[4{j ! )^A^] . For k — n, the blocks are approximately of 
the same size as in the table for A: = 0, and for other momenta, where both (7 = 1 and — 1 
states are allowed, the blocks are roughly twice as large. 



4.1.6. Expectation values and thermodynamics 

We are now ready to diagonalize the hamiltonian and calculate physical observables. 
Here we will first consider complete diagonalization, meaning that we compute all the 
eigenvalues and eigenstates of H. In principle, the eigenvalues A„, n = 1,...,M, of a 
non-singular M xM matrix H can be obtained by solving the secular equation, 

det[//-V]=0, (163) 

where det[] denotes the determinant and / is the unit matrix. The eigenvectors v„ can 
subsequently be obtained by solving the linear system of equations 

HVn = EnVn. (164) 

However, since the secular equation is very complicated for a large matrix, this method 
is not used in practice. Most numerical matrix diagonalization methods are based on an 
iterative search for a unitary transformation U such that 

U~^HU = E, (165) 

where E is the diagonal eigenvalue matrix with = E^ and, for a complex hermitian 
matrix H, the inverse of the unitary matrix U is the transpose of its complex 
conjugate matrix; = U*^. If the matrix is real and symmetric, we have = U^. 

The columns of the diagonalizing matrix U contain the eigenvectors of H. This can be 
seen by multiplying (165) with U from the left, giving HU = UH. Since E is diagonal, 
the nth column of U on the right side of this equality is multiplied by the nth eigenvalue 
(i.e., the matrix element = Xn). In the multiplication by H on the left side, the nth 
column of U gives the nth column {HU)n of HU, i.e., HUn = EnnUn, and thus the 
eigenvectors v„ of H are identified as Vn = Un. 

The expectation value of some observable (operator) A in an eigenstate of H is given 
by the diagonal element of the corresponding matrix transformed to the energy basis; 

(n|A|n) = [U-^AU]nn. (166) 

One would typically be interested in the ground state, n = 1 (assuming the eigenvalues 
to be ordered from the lowest to the highest), of the block in question (especially the 
actual ground state, in the block with the lowest energy of all). For a thermal average, 
all eigenvalues and/or eigenstates of all symmetry blocks are needed; 

1 Mj Mj 

{A) = ^LL^-^'HUi%UjU, Z = H^.-P^J^. (167) 

j n=l j n=l 



Computational Studies of Quantum Spin Systems 



January 18, 2011 



79 



TABLE 2. The eight lowest energies for a 16-site chain with momentum A; = 
in the blocks with parity and spin-inversion quantum numbers (p = ±l,z = ±l). 



n 




S 




S 




S 


£(-1,-1) 5 


1 


-7.1422964 





-4.9014133 




-4.1926153 


2 


-5.7475957 1 


2 


-6.1223153 


2 


-4.1067293 




-3.6537528 





-4.9014133 1 


3 


-5.5912905 





-3.9398439 




-3.6085498 


2 


-4.6986358 1 


4 


-5.0981578 


2 


-3.7756347 




-3.2192241 


2 


-4.1067293 1 


5 


-4.8142442 





-3.6808576 




-3.2129324 





-4.0007340 1 


6 


-4.5657878 





-3.5785191 


3 


-3.1695648 


2 


-3.9398439 1 


7 


-4.3243602 


2 


-3.3678831 


1 


-3.1652647 





-3.7756347 1 


8 


-4.1926153 


2 


-3.3605397 


3 


-3.1169772 


2 


-3.6808576 1 



Here /3 = T is the inverse temperature (in units where ks = 1), and the index j 
collectively denotes the different quantum numbers, m^, k, /?, z, of the blocks of size Mj. 

Matrix diagonalization is a standard linear algebra operation, and sophisticated sub- 
routines are available in many software libraries. We will therefore not discuss the inner 
workings of matrix diagonalization procedures here. Almost always, a diagonalization 
routine delivers the eigenvalues in ascending order in a vector, along with the matrix U 
with the corresponding eigenvectors. 

The number of operations needed to diagonalize an M x M matrix generally scales 
as M^, and the memory required for storage is ~ (even for a sparse matrix, as 
intermediate steps normally do not maintain sparsity). This severely limits the size of the 
matrices that can be fully diagonalized in practice. Currently M 10^ can be handled 
without too much effort on a workstation, and a few times larger on a supercomputer. 
Looking at Table 1, it is then clear that one cannot realistically carry out complete 
diagonalizations for Heisenberg chains larger than ^ 20. Calculations aiming at just 
the ground state, and possibly some number of excited states, can be carried out in other 
ways for larger systems, using, e.g., the Lanczos method discussed in Sec. 4.2. 

Total spin. It is useful to know the total spin S of the energy eigenstates. Since we do 
not use the conservation of S when block-diagonalizing the hamiltonian, we have to cal- 
culate using the states. As we have already noted, this operator is formally equivalent 
to a Heisenberg hamiltonian with long-range interactions, as written explicitly in (113). 
We can use a slightly modified version of the procedures we developed for constructing 
the hamiltonian to obtain the matrix for S^. We then transform it with U, after which we 
have 5(5+ 1) in the form of the diagonal matrix elements as in (166). 

Let us look at some results obtained for a 16-site chain in the k = momentum sector. 
The eight lowest energies are listed for each of the symmetry sectors (p,z) in table 2, 
along with the calculated spin of the states. The lowest-energy state is a singlet with 
{p— \,z— 1). The first excited state of the system is actually not in the A: = sector; it 
is a A: = ;r triplet with (p = -l,z = -1), at energy E = -6.87210668. 

One interesting feature to note in the table is that, in every symmetry sector, all the 
states have either even or odd spin. This is due to the spin-inversion symmetry. An = 
state with N sites and total spin 5 can be written as a linear combination of states made 
up ofN/2 - 5 singlets, (| Uj) - I Mj))/V2, and 5 triplets, (| Uj) + 1 M;))/V2 [48]. 



Computational Studies of Quantum Spin Systems 



January 18, 2011 



80 



Since the singlet is odd under spin inversion but the triplet is even, it follows that a spin 
S state of an N-site system has spin-inversion quantum number z = +lifA^/2 and S are 
both even or both odd, and has z = — 1 else. Thus, even if we do not calculate S we have 
some limited knowledge of it also from z. The low-energy states typically also have low 
spin, 5 = or 1, and these can then be distinguished by z alone. Note, however, that the 
lowest state in a given (p,z) sector does not necessarily have the lowest (even or odd) 
spin, as exemplified in Table 2 by the lowest (/? = — 1 , z = 1 ) level, which has S = 2, not 
5 = 0. However, this is not a low-energy state, as there are 1 1 states with lower energy in 
the k = sector (and numerous additional ones in other momentum sectors). The k = 
state with second-lowest energy also has S — 2, but there are four 5=1 states below it 
in other momentum sectors. 

Magnetic susceptibility and specific heat. As an example of thermodynamics, let us 
calculate two important properties of the Heisenberg chain; the specific heat C and the 
magnetic susceptibility x^- These can be evaluated according to the formulas, 

(168) 

K)2), (169) 

which can both be easily derived using (167). In the definition of x, h is m external 
magnetic field in the z-direction, which couples through a term —hm^ added to the 
hamiltonian. We will here consider the zero-field case only, and thus (m^) = 0. 

Both C and x are special quantities in the sense that they do not depend on the states, 
just the energy spectrum. For the Heisenberg model one can calculate these quantities 
by just considering the m^. = sector, because of the conservation of the total spin 5. A 
spin 5 state is (25+ l)-fold degenerate with magnetization m, = —5, . . .5. In the case 
of C, we therefore just have to sum 168 over the = levels and weight them by 
this degeneracy factor to obtain the full average over all m^. In the case of Xz^ we can 
proceed in the same way, using also the fact that, in a spin-5 multiplet, the average 
(mj) = (S2)/3 = 5(5+ l)/3 in (169). 

The situation is slightly complicated by the fact that there can be "accidental" de- 
generacies not related to any apparent symmetry of the hamiltonian. If such degenerate 
states have different 5, then the diagonalization procedure will give some random (de- 
pending on exactly how the diagonalization is done) mixed spins for the states. One 
should then in principle diagonalize in the degenerate subspace to obtain the spin 
eigenvalues. However, in practice these accidental degeneracies are very rare. For small 
Heisenberg chains they occur only in the ^ = ;r,/7 = l,z = 1 sector. For = 10, there 
are two such states with £' = 0.5, forA'^=12 and 14 there are no accidental mixed-5 
degeneracies, and for = 16 there are four mixed levels at energy 2J. With such a small 
number of degeneracies, and at such high energies, we can just ignore them. 

Fig. 29 illustrates how the results for C and x converge to the thermodynamic limit 
with increasing system size. Going up to A'^ = 16, the results are well converged down 
to T /J ^ 0.25. It is natural that the convergence is more rapid at high temperatures, 
because in the limit T ^ oo the spins are statistically decoupled and the system behaves 



C 
X 



d{H) 
dt 

dh 



4 «">;)- 
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as a set of independent spins. Thus the susceptibility per spin for T — > oo has the single- 
spin Curie form; x ~^ 1/47. For C, an analytic high-temperature form can be obtained 
by calculating the energy exactly for the 2-site system (a single spin has constant energy, 
thus giving C = 0), taking the derivative, and keeping only the leading term of an 
expansion in . This gives C — 3/13r^ when T — )• 0°. The high-T limits are also 
shown in Fig. 29. In the case of C, this form describes the behavior well down to 
T /J ^ 1.5, whereas in the case of % the agreement is good only at higher temperatures. 

When r — 7- 0, both C and x approach zero exponentially for a finite system, as a 
consequence of the finite gap between the ground state and excited states (due to which 
only the ground state contributes to thermal averages as T — )• 0) and, in the case of the 
fact that the ground state of the Heisenberg chain for even is always a singlet (while 
for odd the ground state is an 5 = 1/2 doublet and x thus diverges as T^^). In the 
infinite system, C — t- but x approaches a constant non-zero value. 

Note that here, as well as in many cases we will encounter later on, when we talk 
about r = properties it is important to consider the order in which the limits T — )• 
and A'^ — 7- 00 are taken. The susceptibility of a chain with finite (even) always vanishes 
as r — 7- 0. The temperature at which the exponential drop to zero commences depends 
on the finite-size spin gap, which in the case of the Heisenberg model is oc A'^^i (which 
we will also discuss further and illustrate with data in Sec. 4.3. 1). Thus, for given A^, one 
can expect essentially thermodynamic-limit results down to some T/JocN-^ (which 
can also be roughly seen in Fig. 29), and if one takes the limit N 00 before T 0, 
then the susceptibility remains non-zero at T = 0. 

It is not possible to use exact diagonalization results to extrapolate reliably the prop- 
erties of the Heisenberg chain to the thermodynamic limit at low temperatures. For the 
susceptibility, an extrapolation of exact diagonalization results [146], using a particular 
functional form [147], was already attempted decades ago — the result is known as the 
Bonner-Fisher susceptibility. This extrapolation used also the known exact value of x at 
T ^0 {N = 00), which was available from the exact Bethe ansatz solution of the ground 
state. The Bonner-Fisher curve has been commonly used in analyzing experimental re- 
sults for quasi-one-dimensional antiferromagnets (weakly coupled chains). The most 
important feature here is the location and shape of the maximum at r/7 ^ 0.6, which 
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is already well converged for small systems, as seen in Fig. 29. It has for some time 
now been possible to use other computational methods to study various properties of 
the Heisenberg model on very long chains (as we will discuss in Sec. 5.3.1), and thus 
the low-temperature limit can be accessed more reliably. There are also essentially exact 
results from field-theoretical studies combined with the Bethe ansatz [7]. These cal- 
culations show an anomalous feature; the approach to the T ^ limit is logarithmic, 
and therefore very difficult to reproduce with finite-A^ calculations. The low-r form 
of the Bonner-Fisher curve is therefore incorrect. Experimentally the most important 
feature is the broad maximum. Often effects beyond the Heisenberg chain appears at 
lower temperatures (e.g., due to inter-chain couplings, spin anisotropics, or disorder). 
The spin susceptibilities of some quasi-one-dimensional materials actually follows % of 
the Heisenberg chain down to temperatures as low as 7/50, where the logarithmic 
behavior is prominent [8]. 



4.2. The Lanczos method 

As we have seen in the preceding section, a complete diagonalization of the hamilto- 
nian (or individual blocks) becomes prohibitively time consuming for 5 = 1/2 systems 
with more than Ri 20 spins. For higher 5, the situation is of course even worse. If we re- 
strict ourselves just to the ground state, and possibly a number of low-lying excitations, 
we can reach systems roughly twice as large, by using a Krylov-space technique, such 
as the Lanczos method. 



4.2.1. The Krylov space 



The Krylov space is a sub-space of the full Hilbert space, constructed in such a way 
that the low-lying states of a hamiltonian H of interest should be well approximated 
within it. Consider an arbitrary state |^), e.g., a one with randomly generated vector 
elements ^(i), / = 1, . . . ,M, in the M-dimensional Hilbert space in which H is defined, 
and the expansion of this state in terms the hamiltonian eigenstates (in order of 
increasing eigenvalues, which we here label n = 0,l,...,M— 1). We operate with a 
power of the hamiltonian on this chosen state; 



M-l 
n=0 



maxi^inax 



+ I 



A 



Wn) 



(170) 



If the power A is large, the state corresponding to the eigenvalue l^maxl with largest 
magnitude (i.e., max = or max = M — 1) will dominate the sum, provided that the 
expansion coefficient Cmax 7^ 0. Hence, acting many times with the hamiltonian on the 
state will project out the eigenvector with the eigenvalue ^max- If we want to make sure 
that the ground state 1 1^0) is obtained (exactly for A ^ 00 or approximatively for finite 
A), we can instead of use {H — c)^, where c is a positive number large enough to 
ensure that |£'o — c| > |£'m 1 — c\- Here we will in the following assume that such a 
constant, if required, has already been absorbed into H. 
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While Eq. (170) is guaranteed to produce the ground state when A is sufficiently large, 
a more efficient way to construct a state which approaches the ground state as A ^ oo 
is to consider not only H^\^), but the whole subspace of the Hilbert space spanned 
by the set of states H"^]^), m = 0, . . . , A. These states can be constructed one-by-one 
by successive operations with H on the initial state |^). In this subspace, an optimal 
linear combination of vectors approximating the ground state (minimizing the energy 
expectation value) exists, and the way to find it is to diagonalize H in the generated sub- 
space of A-|- 1 vectors. In addition to projecting out the ground state for relatively small 
A (often many orders of magnitude smaller than the size M of the full Hilbert space), 
this approach can also accurately reproduce a number of low-lying excited states. 

The subspace of the Hilbert space obtained by acting multiple times with on an 
initial state is called the Krylov space. We are here discussing hamiltonians in quantum 
mechanics, but Krylov space methods of course apply to eigenvalue problems more 
generally as well, and are very widely used in many areas of science and engineering. 



4.2.2. The Lanczos basis and hamiltonian 

In the Lanczos method [148], an orthogonal basis is constructed using linear combina- 
tions of the Krylov space states such that the hamiltonian written in this basis is tridiago- 
nal. In a standard approach, which will be described below, a basis { |/m) }, m = 0, . . . , A, 
is first constructed that is orthogonal but not normalized. These states will subsequently 
be normalized to yield an orthonormal set {\^m)}^ in which the hamiltonian takes a par- 
ticularly simple tridiagonal form. We will also discuss a slightly different approach of 
directly generating the normalized basis {|0m)}- 

Generation of the Lanczos basis. The construction of the basis {|/m)} starts from 
an arbitrary normalized state |/o), of which it is required only that it is not orthogonal 
to the ground state of H (if the goal is to find the ground state). This should be the case 
for a randomly generated |/o), but one can also start from some vector which is known 
to have a substantial overlap with the ground state. The next state is given by 

|/i)=H|/o)-flo|/o), (171) 

where the constant aq should be determined such that is orthogonal to |/o). To this 
end, we examine the overlap between the two states; 

(/il/o) = (/ol^l/o) -«o(/o|/o) = Hoo-aoNo, (172) 

where we have introduced notation for the normalization constants and diagonal matrix 
elements of the hamiltonian that will also be used for subsequent states; 

Nni = {fm\fm)j Hmm — ^fm\H-\fm)- (173) 

For the overlap (172) to vanish we must choose ao = Hqq/Nq. The following, m — 2 state 
is written in terms of the two preceding ones as 

|/2)=//|/i)-ai|/i)-^o|/o), (174) 
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where ai and bo can be chosen such that I/2) is orthogonal to both |/o) and The 
overlaps are, using //|/o) and H\fi) obtained from Eqs. (171) and (174); 

(/2I/1) =^11 -aiM, (/2I/0) =Ni-boNo. (175) 

The appropriate coefficients are thus oi = Hn/Ni and bo = Ni/Nq. For all subsequent 
iterations, Eq. (174) generalizes to 

\f^+l)=H\fm) -am\fm)-bm-\\fm-\), (176) 

and the coefficients rendering this state orthogonal to and \fm-i) are generalizations 
of the expressions we already found for ao^^^i) and bo; 

(^m~Hmm/Nfni bffi—\=NfYi/Nffi—\. (177) 

This can easily be checked by direct computation of the overlaps. It remains to be shown 
that with these coefficients the state l/m+i) is orthogonal also to all previous states \fk) 
with A: < w — 1. In an inductive proof, we can use the fact that all previously generated 
states are orthogonal to each other, and obtain, 

{fm+l\fm-k) = {fm\H\fm-k)-Clm{fm\fm-k)-bm-\{fm-\\fm-k) (178) 
= {fm\fm-k+\) + (im-k{fm\fm-k) +bm-k-\{fm\fm-k-\) = 0, 

where we also used H\f„_k) in the form given by Eq. (176). 

This iterative procedure (176) is continued until m = A, where A can be determined 
automatically, on the fly, according to some convergence criterion for computed quanti- 
ties, as we will discuss below. First we need the hamiltonian matrix elements. 

The hamiltonian in the Lanczos basis. Having constructed a set of A + 1 Lanczos 
vectors m = 0, . . . , A, the hamiltonian in this basis can be constructed. We can use 
the expression for H acting on one of the basis states, obtained from Eq. (176); 

H\fm) = \fm+\) + am\fm) +bm-\\fm-\) , (179) 

and thus the non-zero matrix elements are 

{fm-\\H\fm) = bm-lNm-l=Nm, 

{fm\H\fm) = a„Nm, (180) 
{fm+l\H\fm) = Nm+l- 



The normalized basis states are 



|0«) = -^|/^), (181) 



and bm — Nm+i/N^ according to (177). The non-zero matrix elements are therefore 



{<j)m-l\H\({)^) = \/bm-\, 

l^^yn\ii\^m) = «m, (182) 
{^rn+l\H\^m) = V^- 
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This is a tridiagonal matrix, which can be diagonalized using special methods that are 
faster than generic diagonalization algorithms (available in many linear algebra subrou- 
tine libraries). The main advantage is not, however, that the matrix can be diagonalized 
more easily — it is anyway typically not very large and even if a generic diagonalization 
routine is used the time spent on that part of the calculation is negligible. The advan- 
tage is that this matrix can be constructed relatively quickly, especially when the matrix 
H is sparse and only its non-zero elements have to be considered (stored or generated 
on the fly when acting with H on & state). Some times, when pushing the method to 
very large matrices, it is also useful that only three of the large state vectors have 
to be stored in the process. The tridiagonality is also very convenient when calculating 
spectral functions (dynamic correlations), as discussed, e.g., in Refs. [43, 143]. 

Note that with the basis states labeled m = 0, .... A, the size of the tridiagonal matrix 
(the actual basis size) is A, not A + 1, because for m -|- 1 = A in (179) the last diagonal 
matrix element generated for use in (183) is aA-i- The state |/a) does not have to be 
constructed; only the term if |/a i) is needed to calculate the last coefficient aA i. 

Since the cost of diagonalizing the Lanczos hamiltonian matrix is very low, one can 
do that after each new basis state has been generated, and follow the evolution of the 
eigenvalues. One can stop he procedure based on some suitable convergence criterion, 
e.g., the desired eigenvalues changing by less than some tolerance e between iterations. 
The basis size required for convergence of course depends on the system studied, but 
typically, for quantum spin systems, A in the range tens to hundreds should suffice. 

Alternative formulation with normalized vectors. The generation of the Lanczos 
basis is normally discussed in terms of the un-normalized states as we have done 
above. However, a direct computer implementation of this procedure occasionally leads 
to numerical problems, because the normalization constants Nm can become exceedingly 
large (if the eigenvalues of H are large). It may then be better to work directly with the 
normalized states 1 0;„) (as an alternative to multiplying Hby di suitable factor). In fact, in 
practice this formulation is even simpler, and the additional computational cost is merely 
the normalizations at each step (which is normally small compared to other costs). 

We start with a normalized state 1 0o) and generate the second state according to 



Here A'^i is a normalization constant, which is determined by direct computation of the 
scalar product of the constructed state H\^q) — ao|</'o) with itself. Orthogonality with 
|0i) again requires aq = ^00- For each following state one can easily show that 



is orthogonal to all previous states. Here the definitions of a^ and differ from the 
previous ones in Eq. (177); 




(183) 




(184) 




(^„,|//|0„), 



(185) 
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where \ym) is the generated state before normalization as in Eq. (184) above. The 
hamiltonian matrix elements are 



{(^rn-l\H\(^m) = 
{<j)m\H\<l)m) = 




(186) 



{(j)m+l\H\(j)m) = y/Nm+l- 

In this formulation all the stored numbers are well behaved. 

Degenerate states. It should be noted that the Lanczos method cannot produce more 
than one member of a multiplet; out of a degenerate set of states, only a particular linear 
combination of them will be obtained (which depends on the initial state |/o)). To see 
the reason for this, we again look at the expansion (170) of a state H^\^), in which we 
assume that there are two degenerate states 1 1//,) and | Xj/j), Ei = Ej. In the expansion we 
can isolate these states from the rest of the terms; 

H''\^)=Er{ci\xifi)+cj\xifj))+ £ c„£^|v^«). (187) 

For any A, the expansion contains the same linear combination of the states | ij/j) and 
I . Hence, in the subspace spanned by the set of states H'" |^),m = 0,...,A— 1, there 
is no freedom for obtaining different linear combinations of the two degenerate states. 
This of course generalizes also to degenerate multiplets with more than two states. 

Loss and restoration of orthogonality. When the basis size A becomes large, the 
Lanczos procedure typically suffers from numerical instabilities. Round-off errors ac- 
cumulated in the course of constructing the basis set will eventually introduce some 
non-orthogonality among the states. Such numerical errors can escalate and lead to suc- 
cessive sudden appearances (within some narrow ranges of A) of several identical eigen- 
values (recall that the Lanczos scheme should never produce degenerate states). We will 
see an example of this further below. 

Loss of orthogonality and the appearance of multiple copies of the same states is nor- 
mally not a problem when the aim is to obtain only the ground state. It can complicate 
calculations of excited states, however. To remedy this, one can add to the basis con- 
struction a step where each new Lanczos vector constructed is explicitly orthogonalized 
with respect to all previous basis vectors. In the simplest implementation of such a stabi- 
lization procedure, all the Lanczos vectors are stored (in primary memory or secondary 
storage). This can become problematic when dealing with very large basis sets, but the 
scheme is very simple. Working with the normalized states, each \^m+i) constructed 
according to (184) is orthogonalized with respect to all previous states, according to 

10.+!)^^^^^^, ?=(0d0m+l), (188) 
1 q 

successively for ? = 0, . . . , m. This makes it possible to study a much larger number of 
excited states (in principle only limited by computer memory). 
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Instead of converging several excited states in the same run, one can also target excited 
states one-by-one, starting each time from a vector which is orthogonal to all previous 
ones. Re-orthogonalization should then also be done with respect to those. 

Eigenstates and expectation values. Diagonalizing the tridiagonal Lanczos hamil- 
tonian results in eigenvalues and eigenvectors v„, n = 0,...,A — 1. We want these 
eigenvectors expressed in the original basis {|a)}, in which we are able to evaluate the 
matrix elements {b\0\a) of operators O of interest. First, the Lanczos basis states are 

M 

£<l>m(fl)|fl), m = 0,...,A-l, (189) 

a=l 

and we denote the desired eigenvectors of the hamiltonian as 

M 

k«)= E V^n(«)l«), n = 0,...,A-l. (190) 

a=\ 

The first few eigenvectors Vn of the tridiagonal matrix accurately represent eigenstates 
of the hamiltonian (essentially exactly for sufficiently large A) in the Lanczos basis; 

A-l A-1 M 

Wn) = Vn{m)\(j)m) = v„(m)0«(a) |a) , (191) 

m—O m=Oa=l 

and thus the wave function coefficients we want to construct are given by 

A-l 

V^n(«) = E ^«H0m(a), a=l,...,M. (192) 

m=0 

If all the Lanczos vectors have been stored during the basis construction, this transfor- 
mation can be carried out in a straight-forward manner. If there is enough computer 
memory available, one should store all Lanczos vectors, but often calculations are push- 
ing the limits of computer capacity, and then only the bare minimum of information can 
be stored, at the cost of longer calculation times. If we did not store the full set of Lanc- 
zos vectors during the basis construction, we need to generate them again, in the same 
iterative fashion as before, except that we already have the coefficients a,n and Nm avail- 
able (and bm if the scheme with un-normalized Lanczcos vectors \fm) is used) and do not 
need to recompute them. We can transform the states according to (192) on the run with 
the eigenvectors v^, building up one or several of the eigenstates of the hamiltonian. 

Having generated one or several eigenstates, now assumed to be stored in the form of 
the coefficients \l/m{a) in (190), an expectation value of some operator O can be obtained 
by first acting on the state, giving an un-normalized state that we call | V'^f ), 

M 

0\Vn) = \V^) = \lfnia)0\a) 

a=l 
M M 

= E Ev/«(«)l^)(^|0|fl). (193) 

a=lb=l 
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M M 

= Y.^n{a)\a), \j/^{a)=Y,¥n{b){b\0\a). 

a=l b=l 

We then evaluate the scalar product of this state with the original state | \i/n) ; 

M 

{Vn\0\\l/n) = iVnlVn) = L V^n(«) V'n^l^)- (194) 

a=l 

This somewhat cumbersome way of writing the expectation value corresponds to a typ- 
ical computational procedure of first computing the state vector y/^ and then computing 
its scalar product with \j/n. 

It is interesting to note that, in Lanczos calculations, we are often dealing with four 
different bases: First, we have the original "computational" basis of single states of f 
and i spins. From these we generate a basis incorporating symmetries, e.g., momentum 
states (which normally would be the states denoted as \a) above). We then construct 
the Lanczos vectors, which are particular linear combinations of those states. Finally, 
diagonalizing the tridiagonal matrix (which is an effective hamiltonian in the low-energy 
sector), we obtain the desired energy eigenstates. To do calculations with those states, 
we effectively do the basis transformations in reverse. The matrix elements in (193) and 
(194) are in the end carried out in the computational basis of f and | spins (i.e., with the 
representative states used, e.g., to build the momentum states). We next discuss all these 
procedures in practical program implementations. 



4.2.3. Programming the Lanczos method 

While the Lanczos method can be applied to any symmetric (or hermitian) matrix, 
in the case of a spin hamiltonian there is an added advantage in that the hamiltonian is 

a sparse matrix. Although the size of the hamiltonian (an individual symmetry-block) 
can be very large, the number of non-zero matrix elements is much smaller. For a model 
with short-range interactions on a lattice of A'^ sites, a hamiltonian block of M states has 
on the order of NM non-zero elements, which for large M is much smaller than the total 
number of elements M^. Since the most time consuming part of the construction of the 
Lanczos basis is the repeated operations with the hamiltonian matrix on a state vector, 
to generate the next basis state according to Eq. (176) or (184), the sparseness allows for 
enormous time savings. There are similarly significant memory savings advantages as 
well. The non-zero elements of the hamiltonian should then be stored in a compact form, 
or generated on the fly as needed (which, when many symmetries are used, typically 
takes longer than even reading the elements from disk storage). We here implement the 
Lanczos method with the hamiltonian stored in a compact form in primary memory. 

We will generate the Lanczos basis of states that are normalized at each step, 

using Eqs. (184) and (183). In a computer program, these states are stored in the form 
of their vector components </>m(«), in terms of which the Lanczos states are given by 
Eq. (189). Here a= 1, . . . ,M labels the states of our working basis, which in the case 
of maximal use of symmetries would be momentum or semi-momentum states, e.g., 
\a) = \k,p,z,mz = 0, a), but for simplicity we do not write out all the quantum numbers. 



Computational Studies of Quantum Spin Systems 



January 18, 2011 



89 



The nature of the basis states only come into play when acting with the hamiltonian (or 
some operator to be measured) on the basis states, and for this we need exactly the matrix 
elements that have already been discussed for the various symmetry implementations in 
Sees. 4.1.3, 4.1.4, and 4.1.5. We will write pseudocodes assuming real-valued wave- 
function coefficients; the changes needed for complex states are self-evident. 

Lanczos basis construction and eigenstates. To discuss the general structure of 
a Lanczos program, we begin by assuming that we have implemented a subroutine 
hoperation(0, 7) which acts with the hamiltonian on a state vector; I7) = H\^). Later, 
we will describe the implementation of this subroutine, which is the only part of the 
Lanczos procedure that requires a specification of the model and the symmetries imple- 
mented. We also use a subroutine normalize(0,n), which first computes n= {(j>\(j>) and 
then rescales ^ so that = 1 upon return. 

To start the basis generation, we first load the initial state |0o) with a randomly 
generated normzalized state. We denote the elements ^o{i), i — I,. . . ,M and (<^o|<^o) = 
Lj 0o(O^ = 1- The next Lanczos state is generated according to (183); 

call hoperation(0o, ^1) 

«o = (0o|^i); ^/>i = ^/>o-«o|0i) {17} 
call normalize ( 0i , n 1 ) 

If we are not storing all the Lanczos vectors, we can cycle between three vectors, 
0O!0i!027 that contain the information we need at each step when implementing the 
iterative basis generation according to (184); 

do m = 1 , A — 1 

call hoperation ( (^1 , (|)2 ) {18} 

«m=(0l|02); 02 = 02-«m01-«m0O 

call normalize ( 02 , 1 ) 

00 = 0i; 01 = 02 

enddo 

If possible, we should store all the states, to avoid having to regenerate them at the 
later stage when computing expectation values (which doubles the computation time). 
We have to store all the states if we want to carry out additional re-orthogonalization, 
to ensure that numerical truncation errors do not eventually degrade the Lanczos basis. 
In that case we insert an additional loop to orthogonalize with respect to all previously 
generated states, according to Eq. (188); 

do m = 1 , A — 1 

callhoperation(0m,0w+i) {19} 

«m = (0m|0m+l); 0m+l = 0m+l -«m0m-«m0m-l 

call normalize ( 0^+ 1 , 1 ) 

do z = 0, m 

q = (0„+i|0,-); (j)m+l = (0m+l -?^) 

enddo 
enddo 
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After completing either code segment {18} or {19}, we have the contents of the tridi- 
agonal matrix in Eq. (187) and can proceed to feed this information into a diagonal- 
ization routine which delivers the eigenvalues En and eigenvectors with elements v„(/), 
n = 0, . . . , A — 1, / = 1, . . . ,M. It should be noted again that, with the conventions we 
have been using, the Lanczos basis size (the size of the matrix) is A; the last state gener- 
ated in {18} and {19}, with m+ 1 = A, is actually not needed (only the coefficient oa-i 
generated in the last iteration is needed). 

The computational effort of the diagonalization is very small compared to the time 
spent on the basis construction. We may therefore as well diagonalize after each new 
Lanczos vector has been generated. We can then monitor how the energies evolve with 
the basis size. One can then stop when some convergence criterion is satisfied. One can, 
e.g., demand that the change in the ground state energy (or the highest excitation of 
interest) changes between steps m — 1 and m by less than some small number e. The 
Lanczos method is normally capable of converging energies to the numerical precision 
of the computer (and double precision should always be used). As we will see below, 
other properties converge slower than the energies. 

Expectation values. First, let us consider calculations of operator expectation values 
(besides the energy) in a program where we have not stored all the Lanczos basis vectors. 
Then, in order to be able to transform the ground state from the Lanczos basis to the 
original basis, where we can carry out "measurements" on the state, we have to repeat 
the Lanczos basis construction starting from the same initial state as in the first run. If 
we had initialized with a random ^q, as in code {17}, and did not save this state, we 
can re-generate it by initializing the random number generator with the same seeds as in 
the first construction. In any case, with 0o at hand, we can proceed as in code segment 
{18}, with the minor simplification that we now do not have to calculate the coefficients 
am,nm, because we already have them. We can use for normalizing, instead of calling 
the subroutine normalize in {18}. During the basis re-construction, we transform the 
states with the ground state vector vq (or v„ for an excited state /) that resulted from the 
diagonalization of the tridiagonal matrix, building up the ground state in a vector l/A^; 

call hoperatioii(<^o, <^>i); 0i = (0i -«o0o)/v^ 

VA = vo(0)(;)o + vo(l)(^i {20} 
do m = 2,A — 1 

call hoperation(0i , 02) 

02 = (02-am-101 -«m-10o)/v^ 
V/= V/ + Vo(m)02 
00 = 0i; 01 = 02 

enddo 

The elements of the vector ^ now contain the ground-state wave-function coefficients in 
whatever basis is used and implemented through hoperation. If we have stored all the 
Lanczos vectors, we of course do not need this step and can directly transform the states 
generated in code {19}. 

It is particularly easy to calculate expectation values of operators that depend only 
on the z-components of the spins and is invariant under all the symmetry operations 
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used in the basis. We then just have to weight the quantity calculated using the rep- 
resentative states with the corresponding wave-function coefficient squared (the state's 
probability). As an example, the spin-spin correlation function for all distances 

r = 0, . . . , can be obtained as: 

do fl = 1,M 

dor = 0,A^/2 {21} 
doi = 0,iV-l 
j — mod{i + r,N) 
if {sa[i\ = Sa[j]) then 

C{r) = C{r) + \l/^{a) 
else 

C{r)^C{r)-xif^{a) 
endif 
enddo 
enddo 
enddo 
C = C/4N 

Here y/^{a) is the probability of the state | a), which when symmetries are incorporated is 
represented by a number Sa- As before, bit tests can be used to determine the relative spin 
orientation, now of spins separated by r lattice spacings. Note that although the final spin 
correlation (S^S^) only depends on r = | / — j|, we still have to average over all / above, 
because we are only using the representative states (which do not by themselves, without 
acting with the symmetry operators, obey any lattice symmetries). In principle we should 
also average over both reflections and spin-inversions of the representative, but after 
the translational averaging has been done the spin correlation function is also invariant 
with respect to the other two symmetries (which is not the case for all operators). Note 
that since the Heisenberg hamiltonian is spin-rotation invariant, the correlation function 
{S^Sj_^_^) calculated here equals (S, ■ S/+r)/3. 

Calculating expectation values of z off-diagonal operators that cannot be simply re- 
lated to z diagonal ones require explicit operations on the states, and thereafter eval- 
uation of a scalar product, according to Eqs. (193) and (193). This is in principle 
easy — essentially proceeding as in the construction of the hamiltonian in the preced- 
ing sections — but more time consuming than diagonal operators. 

Compact storage of the hamiltonian. We now discuss the inner workings of the 
subroutine hoperation(0 , y) that we employed in the Lanczos procedures. It implements 
the operation 

M M 

H\^) = Ir) = L L m{b\H\a)\b). (195) 

a=\b=\ 

This is of course where all the details of the symmetries employed will enter. The main 
difference with respect to the construction of the hamiltonian in complete diagonaliza- 
tion is that we do not want to store as a full M xM matrix, because we have in mind 
calculations were the number of basis states M can be up to many millions (e.g., for 
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= 32 in Table 1). We therefore have to devise a convenient way of only storing its 
non-zero elements, of which there are of the order NM in (195). 

We define the following data structures for the compact hamiltonian storage: For each 
a= 1,...,M in (195) we store the number of non-zero matrix elements {b\H\a). 
We store the locations b of these non-zero elements as consecutive integers in a vector 
with elements B{i). In a program, it is convenient to use the elements i— l,...,ei 
for a = 1, followed by / = ei + 1 , . . . , ei + ^2 for a = 2, etc. The required size of the 
vector B (the number of non-zero elements) is initially not known, and, depending on 
the programming language used, may have to be allocated using an estimated size. We 
can store the corresponding non-zero matrix elements in a vector with floating-point 
values H{i). We can also take advantage of the fact that the hamiltonian is a symmetric 
matrix and only store one of its "triangles". 

With the above notation, carrying out the operation (195) is now as simple as: 

subroutine hoperation(0, y) 

7=0; i = 
do a = 1,M 
do 7= l,ea 
i = i+ 1 

r{B{i)) = r{B{i))+H{i)(j>{a) 
r{a)^r{a)+H{i)(j>{B{i)) 
enddo 
enddo 

The counter / keeps track of the position of the elements in the data structures H and 
B. Diagonal matrix elements, B{i) = a, are double counted in this procedure since the 
contributions from the upper and lower triangle of the hamiltonian matrix are added. It 
is therefore assumed that their stored values have been divided by 2. 

The matrix elements H{i) and their locations B{i) are generated in a way similar to 
what we did when constructing the complete hamiltonian, e.g., as in code segments 
{13}-{16} in the semi-momentum basis with parity. We also discussed the simple ex- 
tensions involving spin-inversion symmetry. Now we discuss the modifications needed 
when we wish to load only the non-zero elements into our compact storage. 

In the case of the semi-momentum basis, which we will consider here, the procedures 
are again somewhat complicated by the fact that the same representative can appear once 
or twice (c7 = ±1) in the state list, and we want to take care of these at the same time 
in order to avoid repeating tasks unnecessarily. We therefore carry out the loop over the 
basis states a and determine the number of same representatives n as in code {13}. We 
cannot put the matrix elements directly into the storage vector H, because our scheme 
in code {22} requires consecutive storage of all matrix elements for each column. With 
the way the individual bond operators in the hamiltonian are treated one-by-one in code 
{16}, when n = 2 the two columns would be mixed up if we store each matrix element 
as it is generated. We therefore use temporary storage for the one or two columns being 
currently processed, and then later copy their contents into the appropriate positions of 
the final storage. Referring to the first (a) and potential second (a+l) column as c = 1 
and c = 2, we store the data temporarily as Bc{k) and Hc{k),k=l,...,nc, where ric is 
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{22} 



the number of elements for the column in question. The diagonal elements (divided by 
2, as discussed above) are entered first into these temporary storage lists; 



ni = 0; n2 = 
do c — l,n 



{23} 



Bc{l) = a + c-l; Hc{\) = Ej2-nc=\ 



enddo 



For the off-diagonal matrix elements we proceed as in code segment {16}. Note, how- 
ever, that several bond operations may lead to the same state \b) when acting on a given 
basis state \a), but in the compact storage we do not want to store individual contribu- 
tions to the same matrix element separately. Everything is taken care of by replacing the 
loops over / and j in code {16} by this extended version; 

do / = a,a-|-n 

do 7 = max(&, /),&-!- m {24} 
c = i — a+\ 
E = helemeiit(j,7,/)?) 

do ^ = I, He 

if 5c (A:) = j exit 
enddo 

if (k > He) then ric = k; Bc{k) = j endif 

if 0" = then Hcik) = Hdk) +E/2 else Hc{k) = Hc{k)+E endif 



Here max(/, b) on the second line ensures that only the elements in matrix triangle j > i 
are constructed. The innermost loop over k checks weather there is already a stored 
contribution to the matrix element {j\H\i). It has been assumed that when the loop over 
k has been completed (without exiting before k = nc), then k takes the value ric + l (which 
is the case in many computer languages). Then A: = nc -I- 1 if there is no prior location 
j in the list, which means that it should be added to the list (and the size of the list is 
then k). We have also assumed that all Hc{k) are initially set to zero, so that each new 
contribution E can be added in the appropriate location of He. Note that although we are 
carrying out off-diagonal operations here, in the basis we are using, such operations can 
also lead to diagonal matrix elements, in which case we have to divide E by 2. 

After having generated all the matrix elements originating from the current represen- 
tative (i.e., completed the loop over all nearest-neighbor spin pairs), we copy the contents 
of the temporary storage vectors into the permanent full storage; 



Here hh is a counter for the total number of non-zero matrix elements added so far. 
After this, the loop over state labels a is closed, as in code {13}. This completes the 



enddo 
enddo 



doc= 1,2 
do / = l,nc 

nH = nH+l; B{n„) = Bc{i);H{nH) = Hc{i) 



{25} 



enddo 
enddo 
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A A 

FIGURE 30. Example of the convergence as a function of the Lanczos basis size of the energy (left) 
and the total spin (right) of the four lowest levels of the 24-site Heisenberg chain in the symmetry sector 
k — Q,p — l,z— 1. The spin S is extracted using the assumption that (S^) — S{S+ 1). 



construction of the hamiltonian. 

For other operators, it is not worth storing the matrix elements, because normally we 
do not reuse operators for observables many times. The hamiltonian is used repeatedly, 
however, up to a few hundred times, and so using code segment {21} instead of having 
to carry out all the operations associated with extracting each individual hamiltonian 
matrix element from scratch every times can amount to a very significant speed-up. 

It is possible to further compactify the hamiltonian by not storing the matrix elements 
as double-precision numbers, but instead use a mapping to the actual numbers based on 
a table of integers. The number of unique values is often very small (tens or hundreds of 
values, so that one can even use "short" integers as pointers to the actual values), and this 
can save some memory, at the cost of a somewhat more time consuming construction of 
the hamiltonian. The locations B{i) have to be stored as four-byte integers, however. 

When pushing the limits of the largest treatable system sizes, one may have to store the 
hamiltonian on disk (and reading successive portions of it when executing code {22}), 
or generate it on the fly without storing it. The latter essentially amounts to executing 
code based on {24} every time when acting with the hamiltonian. 



4.2.4. Convergence of Lanczos calculations 

The Lanczos method is essentially exact if a sufficiently large number of Lanczos 
vectors A is used, and typically this number does not even have to be very large; on 
the order of a few tens to hundreds. The convergence should be checked by carrying 
out calculations for several A, until no changes can be detected in the energies and 
expectation values of interest. The ground state converges the fastest, and energies 
converge faster than expectation values. As an example. Fig. 30 shows results for an 
N = 24 chain. The energy and the total spin of the four lowest levels in the symmetry 
sector of the ground state are shown versus A. The spin quantum number is calculated 
by acting with the squared total spin operator on the states, according to Eqs. (193) 
and (194), and extracting S assuming = 5(5+ 1) (which is valid only when the 
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FIGURE 31. (a) The four lowest energies as a function of the Lanczos basis size for a 16-site Heisen- 
berg chain with quantum numbers (k = 0,p = l,z — 1). Muhiple copies of the same state appear suc- 
cessively due to loss of orthogonality, (b) The five lowest states of the same system obtained with re- 
orthogonalization of the basis set. 



States have converged to eigenstates of the operator). The energies are seen to converge 
monotonically, whereas this is not necessarily the case for other quantities, as seen 
clearly for the 5 = 2 states in this case. The details of the convergence of course depend 
on the initial state from which the Lanczos basis is constructed (which in this case was 
a random state). In this case all the four levels shown (E as well as 5) were converged to 
better than 10 decimal places at A = 60, with the ground state having converged at that 
level already at A = 30. Going to larger system sizes, the convergence becomes a little 
slower, but for this particular model there are no difficulties in converging several levels 
up to the largest system sizes that can feasibly be studied. 

One can accelerate the convergence of a Lanczos calculation by starting from a 
state which is already close to the ground state. Such states may be constructed in a 
number of ways, e.g., based on some approximate analytical method. But if there are no 
convergence problems this may not be worth the additional effort. However, if a series 
of calculations are carried out as a function of some parameter in the hamiltonian, then 
subsequent calculations can be started from the ground state of the preceding parameter 
value, which is likely to have a significant overlap with the next ground state. However, 
it should be noted that if the initial state is a good approximation to the ground state, 
it will have very small overlaps with the first few excited states, and hence only the 
ground state is likely to converge rapidly in such a calculation. If excited states are also 
needed, this problem can be circumvented by starting the next calculations using a linear 
combination of eigenstates from prior calculations. 

Loss and recovery of orthogonality. The Lanczos basis vectors should all be com- 
pletely orthogonal to each other, but numerical truncation errors build up and eventually 
lead to escalating loss of orthogonality for some A. This manifests itself as artificial de- 
generacies, with excited states "falling down" onto lower states. An example of this is 
shown in Fig. 31(a), where the four lowest energies of a 16-site chain in the ground-state 
symmetry sector are graphed versus A. The higher energies are seen to successively col- 
lapse onto the immediately lower energies, with only a few iterations taken for the levels 
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to become completely degenerate after signs first appear of the imminent collapse. The 
basis then includes multiple copies of the same states. 

The loss of orthogonality may at first sight seem like a serious problem, but in practice 
this is no necessarily the case. Calculations for the ground state are not much affected 
by there being more than one copy of it, as long as one makes sure that it is properly 
normalized before using it to calculate expectation values, and the first excited states are 
normally (but not always) well converged before they fall down onto lower states. 

Loss of orthogonality often occurs sooner for small systems than large ones. This is 
related to the fact that there are less low-energy states for small systems, whence the 
lowest states in the Lanczos basis can become "overcomplete" in this subspace (if the 
Lanczos basis is larger than the total number of states, it clearly is truly overcomplete 
and the calculation will not work). 

One can easily supplement the Lanczos method by an explicit re-orthogonalization 
step, as in Eq. (188) and implemented in code {19}. The drawback is that this requires 
storage of all the Lanczos basis vectors, which may not be possible for large systems. 
The procedure may also become time consuming if the number of states is large. As 
shown in Fig. 3 1(b), re-orthogonalization enables convergence of many more eigenstates 
(limited only by memory and time constraints). 



4.3. ID states and quantum phase transitions 

In this section we discuss several related ID calculations using the Lanczos method 
and finite-size scaling methods for up to = 32 spins. First we investigate the critical 
ground state of the standard Heisenberg chain, and also analyze some properties of 
excited states. Then we introduce frustration, studying the dimerization transition in the 
J1-J2 chain. Finally, we add long-range interactions in addition to frustration, in which 
case the continuous dimerization transition evolves into a first-order transition between 
a Neel state (which is possible even in a ID system if the interactions are sufficiently 
long-ranged) and a dimerized VBS state. 



4.3.1. Ground state and excitations of the Heisenberg chain 

Although the Heisenberg chain has an exact Bethe ansatz solution [51], the wave 
function is very complicated and in many cases numerical calculations for finite-size 
systems have to be used to extract information on physical properties from it [53, 54]. 
The energy of the ground state [52] and the low-lying excitations [149] can be calcu- 
lated exactly both for finite chains and in the thermodynamic limit, however. Some other 
quantities can also be extracted for very large chains [150]. In addition to using the exact 
solution, many properties of this class of system (i.e., with a wider range of interactions 
maintaining the symmetries of the system) are known based on an asymptotically exact 
low-energy field-theory description; the Weiss-Zumino-Witten non-linear a model with 
topological coupling (or central charge) c — I [56]. Equivalently, the Heisenberg chain 
also represents a special case (because of its spin-rotational invariance) of the Luttinger- 
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FIGURE 32. Dispersion of the lowest excitations, relative to the ground state, in the sectors with spin- 
inversion quantum number z = 1 (giving low-energy states with spin 5 = or 2) and z = — 1 (which always 
gives 5=1 for the lowest state) for an = 32 chain. The solid curve below the data points is the exact 
infinite-size dispersion derived by des Cloizeaux and Person [149] using the Bethe ansatz. 



liquid State, which describes a broad range of interacting ID spin, fermion, and boson 
systems [151, 152]. Numerical studies based on exact diagonalization and other unbi- 
ased finite-lattice techniques (such as QMC and DMRG) have played an important role 
in guiding and confirming these theories (see, e.g., [153] and [7]). Computational studies 
are also required for extracting non-universal (short-distance, higher-energy) properties 
that are not captured by universal continuum field theories (including quantities directly 
accessible to experiments, as discussed in, e.g, Refs. [154, 155]). Numerical results for 
the Heisenberg chain are also very important for enabling rigorous bench-mark tests of 
extrapolation techniques when studying other spin models, for which less is known from 
analytical calculations. 

Ground state and low-energy excitations. For a Heisenberg chain of size = 4-n, 
the ground state has momentum k = 0, parity p = I, and spin-inversion number z = 1 ; it 
is a fully symmetric singlet state. For N = 4n + 2 the lowest-energy state is in instead in 
the completely antisymmetric sector; k = n,p = —l,z — — l,S = 0. 

Let us look at the most important excitations of the Heisenberg chain. For k not equal 
to the ground-state momentum, the lowest-energy state of a finite system is a triplet; 
hence z = — 1 and +1 for chains of size N = An and An + 2, respectively. Fig. 32 shows 
the momentum dependence of the excitation energies (o{k) = Es{k) — Eq for both the 
lowest z = +1 and z = — 1 states of a 32-site chain. The lowest z = +1 state always 
has 5=1, while the lowest z = — 1 state has either 5 = or 5 = 2. The triplet energies 
are quite close to the exact N = oo triplet dispersion obtained from the Bethe ansatz 
[149], COs{k) = 7r\ sin(^)|/2, especially close to ^ = 0. Close to k = k the deviations are 
larger. The 5 = 0, 2 excitations are mostly slightly higher in energy, except that the 5 = 2 
state close to fc/;r = 0.85 is actually marginally lower than the triplet (likely a small-A'^ 
anomaly), and at fc/;r = 2/A^ it is much higher. 

For large N oo, one would expect the singlet and triplet excitations to become de- 
generate with dispersion ft)D,i(^) = ^| sin(^)|/2. This can be understood as originating 
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FIGURE 33. Singlet and triplet gaps of the Heisenberg chain versus the inverse system size. The inset 
shows the gaps multiplied by A^, illustrating the presence of multiplicative logarithmic corrections to the 
dominant ^ \/N scaling (dynamic exponents =1). 



from excitations of pairs of spin- 1/2 soliton-like degrees of freedom called spinous. 
These spinons are very weakly interacting and in the infinite chain behave as indepen- 
dent particles, hence forming four degenerate levels (from which 5 = and 1 states can 
be formed). The dispersion graphed in Fig. 32 is only the lowest edge of a continuum of 
spinon excitations, which can be calculated in detail using the Bethe ansatz [53] and has 
also been observed experimentally in quasi-lD antiferromagnets [156]. 

The very lowest excitation for ^lN — An chain is a triplet atk= n, with p = — l,z = — 1, 
whereas for 4n + 2 the lowest triplet has fc = 0,p = l,z=l (i.e., the difference in 
momentum with respect to the ground state is always n, and p,z are minus their ground- 
state values). Fig. 33 shows the finite-size scaling of these lowest triplet and singlet 
excitation energies (the singlet and triplet finite-size gaps) versus the inverse system size. 
They are both seen to scale to zero as l/N, corresponding to a dynamic exponent z = 1 . 
There is a weak correction to this form originating from logarithmic corrections, which 
can be seen clearly when plotting the gaps multiplied by (in the inset of Fig. 33). 
This gap scaling has been predicted in detail based on the the continuum field theory 
approaches [57, 153]. The prefactor of the gap scaling is directly related to the velocity 
of the spinon excitations, but it is not easy to extract it reliably based on the finite-size 
data because of the log corrections. The velocity can also be extracted from the linear 
parts of the dispersion relation close tok = and n, (Os{k)/k or (Os{k) /{n — k), with the 
k = behavior being the easier to analyze, as seen clearly in Fig. 32. 

Spin correlation function. The spin correlation function is one of the key character- 
istics of of any quantum spin system. The correlations in the ground state are of primary 
interest. For a spin-isotropic and translationally invariant chain we can write it as 



for any reference site As we discussed in Sec. 2, the Mermin-Wagner theorem rules 
out antiferromagnetic long-range order in the Heisenberg chain. Instead, in the ground 



C(r) = (SrS,+,)=3(5^-5^-+,), 



(196) 
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FIGURE 34. Absolute value of the spin correlation function, multiplied by the separation r, in the 
ground state of the Heisenberg chain. Results are shown as a function of distance for different system 
sizes, and also at r — N/2 versus the chain length A^. 



State C(r) decays as {—ly/r for large r, up to a multiplicative logarithmic correction 
(originating from a marginally operator in the field-theoretical description [57, 58, 59]). 

Fig. 34 shows ground-state results for C(r) as a function of r for system sizes A'^ = 
16, 24, and 32. The results have been multiplied by r, so that an r-independent behavior 
should obtain for large r if the asymptotic form is ~ 1 /r. It is also useful to multiply by 
( — l)*", in order to cancel out the ± oscillations of the staggered phases. Note that there 
are remaining even-odd oscillations in C(r)(— 1)''. Such oscillations are quite common 
for various correlation functions of ID systems. They diminish with increasing system 
size but are still strong for the small system studied here. The form of the oscillations is 
also predicted by the Luttinger liquid theory; they decay as 1/r^ [152]. 

In Fig. 34, clear deviations from the 1/r form of C(r) can be seen which are 
mainly due to the periodic boundary conditions, which enhance the correlations close 
to r = N/2. Open boundaries cause even more severe finite-size effects (in addition to 
not allowing the use of translational symmetry for block-diagonalization). The logarith- 
mic corrections should play some role as well. As the chain length is increased, the 
correlation function at fixed r converges to the infinite-A'^ limiting form. However, as can 
be seen in the figure, the maximum r for which the results are approximately converged 
is a rather small fraction of A^. For A^^ = 32, one could safely say that C(r) is converged 
for A'^ only up to r ^ 4, which is not enough to say much about the long-distance behav- 
ior. An alternative, also shown in the figure, is to investigate the correlation function at 
fixed r/N as a function of A^, with r/N = 1/2 the most natural choice for checking the 
long-distance behavior. Even though periodic boundaries enhance the correlations sig- 
nificantly at this point, the functional form of C(r = N/2) is proportional to the infinite- 
size C(r) [although the overall prefactor of function C(r = A^/2) versus A^ will clearly be 
different from that in the actual N oo converged C(r)]. The leading 1/r form appears 
quite plausible based on the data in Fig. 34, and the remaining enhancement is consistent 
with the predicted logarithmic corrections. In Sec. 5.3.1 we will further investigate the 
long-distance correlations based on quantum Monte Carlo calculations for much longer 
chains. 
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4.3.2. Frustration-driven quantum phase transition 



A very interesting aspect of the 5 = 1 /2 Heisenberg chain is that it exhibits several 
ground-state phase transitions (quantum phase transitions) when a next-nearest-neighbor 
interaction is added to the hamiltonian; 



This model goes under the name of the J\-J2 chain, or the Majumdar-Ghosh model (after 

the authors of the first comprehensive study of the system [60]). Here we will discuss 
the most important (and most well understood) transition, which occurs when both Ji 
and J2 are antiferromagnetic (positive). For convenience we define the ratio g — Ji/Ji. 

For g < gc, gc ~ 0.2411 [157], the system is in the same phase as the Heisenberg 
chain with 72 = discussed in the previous section. The ground state is critical with 
antiferromagnetic correlations decaying as 1/r and the finite-size gaps scale as 1/A'' (up 
to log corrections in both cases). Only the prefactors (e.g., the velocity of the excitations) 
and the strength of the log corrections change as a function of g. 

For g > gc the system is in a completely different kind of state, with exponentially de- 
caying spin correlations and a triplet excitation gap which remains finite in the thermo- 
dynamic limit. The ground state has long-range dimer order (a simpler, ID version of the 
2D VBS states discussed in Sec. 2.2). The nearest-neighbor bond strengths (the Ji energy 
contribution), (5,) = (S,- ■ S/+i), are of the alternating (period two) form 5,- = 5 + 5(— 1)' 
in the symmetry-broken infinite- size state (of which there are two degenerate ones, with 
oscillations out-of -phase relative to each other). Unlike magnetic order, this kind of or- 
der is allowed at zero temperature in one dimension because the broken symmetry (the 
lattice translational symmetry) is discrete. The modulation 5 becomes non-zero at ^ = 
and thereafter increases with g. The spin correlations are initially staggered (peaked at 
momentum k — nm reciprocal space), but dXg^ 0.52 the change to A: = ;r/2 [158], and 
for ^ > 1 the peak- value may change continuously with g (a spiral state) [159]. 

The model with ferromagnetic (negative) 7i is also interesting. There are several 
transitions between states with different periodicities [160]). Here we will consider 
exclusively the antiferromagnetic case, in the regime g <l. We will use Lanczos results 
to investigate the phase transition into the VBS state, and also discuss the properties of 
this ordered state. 

The Majumdar-Ghosh point. Before we discuss the Lanczos results, it is wort noting 
that the existence of VBS order can be shown exactly at the special point g = 1/2 (the 
Majumdar-Ghosh point), where the ground state is very simple [60]. On a ring with even 
A^, the ground state is a two-fold degenerate singlet product, as illustrated in Fig. 35. One 
can demonstrate this rather easily by just acting on the states with the hamiltonian, to 
show that they are eigenstates (while the proof of them being the lowest states is more 
involved [161, 162, 163]). Out of these degenerate states, |^^) and \^b), which break 
the translational symmetry, one can form symmetric and anti-symmetric states, which 
have momentum k = and tt, respectively: 




(197) 



(198) 
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FIGURE 35. Illustration of the degenerate ground states at the Majumdar-Ghosh point. The thick arcs 
illustrate singlets; the product of these singlets on alternating bonds is an exact ground state al g ~ 1/2. 



These are the states obtaining in Lanczos calculations with conserved momentum. Thus, 
the order parameter (5,) = Ei is featureless when calculated on finite periodic systems. 
As always in exact finite-lattice calculations, the symmetry-breaking has to be observed 
by calculating correlation functions (unless we add some perturbation that breaks the 
symmetry between the two possible ordering patterns, which has its own complications 
as the limit of vanishing perturbation has to be taken). The ground state away from the 
point g = 1/2 is more complicated than than the simple singlet-products in Fig. 35, 
with fluctuations in the singlet pairings, but with remaining alternating higher and lower 
density of singlets on the nearest-neighbor bonds. The translationally invariant k = 
and n states on finite rings correspond approximately to symmetric and anti-symmetric 
combinations as in Eq. (198). It is, however, only at ^ = 1/2 that these two states are 
exactly degenerate for finite N. In other cases, the states become degenerate only in the 
limit N o°. 

Dimer order parameter We already discussed VBS ground states in Sec. 2.2 and 
an example of how to characterize this kind of order by dimer (or bond) correlations 
in Sec. 2.4.3. We now investigate the dimer correlation function introduced in Eq. (25), 
written explicitly for a ID system as 

D{r) = {BiBi^r) = ((SrS,-+i)(S,+,-S,-+i+,)). (199) 

This function should alternative between "weak" and "strong" values for large r if there 
is VBS order (even if the symmetry is not broken). The calculation of this correlation 
function with the Lanczos method can be simplified by taking advantage of the rotational 
symmetry and compute ((S,- ■ Si^\)S\^^S\^^^^) , which is 1/3 of D{r). In principle, we 
could make it even simpler by defining the order parameter solely in terms of the z 
components, (5j5?^j5'?,_^5'?^j^^), which is not just a constant times (199) but still a valid 
order parameter for a dimerized state. Here we consider the full D{r) . 

Fig. 36(a) shows results for a 32-site system at three different couplings. At ^ = 0, the 
standard Heisenberg model, the correlations decay rapidly with r and are very small at 
r = N /2. At the transition point, gc ~ 0.241 1 (which we will determine more precisely 
below), the correlations are clearly stronger. The Luttinger- liquid theory applied to spin 
chains predicts dimer correlations decaying as l/rioxg< gc (with stronger logarithmic 
corrections when g < gc) [152]. Going to larger coupling ratios. Fig. 36(a) shows 
correlations clearly indicative of long-range order at g = 0.4. To confirm the presence 
of long-range order, a finite-size scaling analysis has to be carried out. As we did with 
the spin correlation functions of the Heisenberg chain in the preceding section, it is 
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then typically best to look at the correlations at the longest distance, r = N /I, versus the 
system size. In the case of the dimer correlations, where the "bare" correlation function is 
non-zero even in the disordered phase, one has to subtract off the constant corresponding 
to this background. One possibility is to subtract (5/)^, which is the value to which D{r) 
should converge for large r. Another option, which we will choose here, is to instead 
use the difference D{N/2) — D{N/2 — 1). This quantity will be non-zero for N ^ oo 
only if D{r) oscillates, i.e., if there is long-range dimer order. Results for three different 
system sizes are shown versus the coupling ratio in Fig. 36(b). Here the order parameter 
appears to be well converged in the range 0.4 < g < 0.6 confirming that there is indeed 
long-range order. 

Finite-size extrapolations of spin and dimer correlations. To extract the infinite- 
size dimer order parameter quantitatively, it is in most cases necessary to perform an 
extrapolation, unless the system size is sufficiently large for there to be no remaining 
size-dependence of significance [which based on Fig. 36(b) is the case close to ^ = 1/2]. 
Fig. 37 shows results for both dimer and spin correlations versus the inverse system size 
for representative g values both inside and outside the VBS phase. 

The dimer correlations, shown in the right panel of Fig. 37, are of the 1/r form for 
g < gc. There are logarithmic corrections, which are small exactly at gc. For g > gc, 
inside the VBS phase, the correlations extrapolate to a non-zero value. The asymptotic 
^ oo convergence should be exponential in this case (as can be demonstrated explicitly 
at g = 1/2) but close to the transition it is in practice not possible to reach system sizes 
sufficiently large to observe this behavior. Instead, closer to gc the behavior appears to be 
essentially linear in 1/A^, as seen in the figure at ^ = 0.4. However, at g — 0.45 one can 
see that the behavior is actually non-monotonic, with the large-A'^ behavior consistent 
with an approach to the infinite-A'^ value from below, as at ^ = 1/2. As seen in the right 
panel of Fig. 36, for ^ > 0.6 the dimer order parameter decreases with g. Larger systems 
are required for proper extrapolations in this case, and other methods have to be used. It 
is believed that the system remains dimerized for all g > gc [158, 159]. 

The spin correlations should change from 1 /r to exponentially decaying as the VBS 
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FIGURE 37. Spin (left panel) and bond (right panel) correlations at the longest distance in periodic 
chains versus the inverse system size. The couplings ratios for the five curves are g = 0,gc, 0.40, 0.45, 0.50, 
in the order top to bottom (the left panel) and bottom to top (right panel), i.e., the three sets shown with 
black circles correspond to the VBS phase. 



phase is entered. To detect this change, the correlation function C(r = N /2) is multiplied 
by in the left panel of Fig. 37. As we saw before, for ^ = there is a log-correction 
which enhances the long-distance spin correlations. At ^ = gc these corrections are much 
smaller — it is known that the leading log corrections vanish at the dimerization transition 
[164] — although the behavior is still not purely 1/r (which can be expected only in the 
limit N — > oo). For g inside the VBS phase, rC{r) decays to zero, as expected. Close to 
gc one of course has to go to large system sizes — larger than the spin correlation length 
(which diverges at gc) — in order to observe a pure exponential fall-off. 

For the ^-values used in Fig. 37, the spin correlations are staggered, i.e., peaked at 
k = TZ'm reciprocal space. For g ^ 0.52 the peak position changes rapidly to Tz/2 [158], 
and for g > \ the spin structure most likely evolves into a spiral with continuously 
varying pitch [159] (as is the case for the classical version of the model). The correlations 
always decay exponentially, however (unlike the classical long-range spiral). 

Determining the dimerization transition point. The dimerization transition is known 
to be similar to the Kosterliz-Thouless transition of the classical 2D XY model. Unlike 
a conventional phase transition, the order parameter does not follow a power law at 
gc, but is exponentially small close to gc- It is therefore not possible to extract the 
infinite-A'' order parameter close to gc based on the small systems accessible to Lanczos 
calculations — due to the cross-over behavior it is even difficult relatively deep inside 
the VBS phase. It would then appear to be very difficult to determine the location of 
the dimerization transition. There is, however, a very elegant way to extract the critical 
coupling in an indirect way, based on excited-state energies [157]. 

As we saw in Sec. 4.3.1, the lowest excited state of the pure Heisenberg chain {g = 0) 
is a triplet. On the other hand, we also know that in the VBS state the ground-state should 
be two-fold degenerate, and both these ground states must be singlets. At the exactly 
solvable point g = 1/2, the degeneracy is exact for any A'^, but away from this special 
point the two states become degenerate only in the infinite-A^ limits. The approach to 
degeneracy should be exponential in N (as will be illustrated with data below), whereas 
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FIGURE 38. (a) The three lowest energies for a 16-site chain as a function of the coupHng ratio The 
crossing point gcmss of the singlet and triplet levels can be used as a size-dependent definition for the 
critical (dimerization) coupling g^^. The vertical line indicates the exactly solvable Majumdar-Ghosh point, 
at which the singlet states are degenerate, (b) The crossing point versus 1 /A^^. The curve is a polynomial 
fit in (without a o= term, as the leading correction is o= 



in the case of the triplet excitation of the Heisenberg chain the gap closes as Xj'N , and the 
lowest singlet also approaches the ground state as 1 jN (as shown in Fig. 33). This being 
the case, for fixed N there must be some coupling ratio Across (A'^) at which the singlet 
and triplet excitations cross each other. This point can be taken as a size-dependent 
definition of the transition point, and as — t- oo it should approach the actual critical 
coupling; gcmssiN oo) = g^. 

Fig. 38(a) shows the three energy levels of interest for a 16-site chain, with the 
corresponding quantum numbers indicated as well (these quantum numbers apply to 
all system sizes A'^ = An). Note how the two singlets become degenerate as ^ —t- 1/2. As 
expected, the singlet and triplet excited levels cross; for this system size dX g'^ 0.242. 
The crossing points converge very rapidly as a function of A'^, as shown in Fig. 38(b) — the 
leading corrections are known to be proportional to l/N^ [164]. Based on these results 
it is possible to determine gc very precisely. Fitting a high-order polynomial to crossing 
points for 8 < < 32 gives gc = 0.2411674(2), where (2) indicates the uncertainty in 
the last digit based on fluctuations in the extrapolated value when different ranges of 
system sizes are included in the fits and the order of the polynomial is varied. 

Finite-size gaps in the VBS phase. As already mentioned, to accommodate symme- 
try breaking in the thermodynamic limit, one would expect the singlet-singlet finite-size 
gap to close exponentially fast with increasing system size inside the VBS phase (with 
the exception of the Majumdar-Ghosh point, where the degeneracy is exact for any size). 
The exponential gap scaling is demonstrated in the left panel of Fig. 39 for some repre- 
sentative values of g. At gc the behavior is instead oc \/N, which is not clearly seen in 
this lin-log plot, but more clearly in the log-log plot in the right panel. In the right panel 
triplet gaps are also shown versus 1/A^. Here extrapolations to non-zero values inside the 
VBS phase are apparent. The finite gap essentially corresponds to the energy required to 
promote a singlet bond into a triplet. Note that only the ground state is exactly solvable 
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FIGURE 39. Singlet (left panel) and triplet (right panel) gaps as a function of system size for different 
coupling ratios. Here Across is the size-dependent coupling at which the lowest singlet and triplet excita- 
tions are degenerate (and thus Aj = A,). The gap at Across decays as 1 /N (as seen more clearly in the right 
panel). In the left panel a log scale for A, versus is used to show the exponentially rapid closing of the 
gap between the two singlet states that become degenerate for ^ oo in the ordered VBS phase. To the 
right, the triplet gap is plotted versus 1 /N to show its convergence to a non-zero value in the VBS phase. 



at the Majumdar-Ghosh point, and the triplet gap is size dependent also here. At gc the 
gaps scale as 1 /N — in the figure the behavior at the size-dependent crossing points Across 
is shown and also scales as 1/A^. 



4.3.3. Chains with long-range interactions 

While long-range spin ordering is not possible in Heisenberg chains with finite-range 
interactions, long-range interactions make magnetic order possible at T = 0. A transition 
between a Neel state and a quasi-long-range-ordered (QLRO) state (the Heisenberg 
critical state with spin correlations decaying as 1/r, discussed in the preceding sections) 
takes place in a system with distance-dependent couplings of the form Jr<^ {—ly^^/r" 
[72]. The signs here favor antiferromagnetic order, and there is no frustration. When 
the exponent a < otc ~ 2, the ground state has true long-range Neel order, while for 
a > ttc the system is in the QLRO phase. The critical value of the long-range interaction 
parameter Uc depends on details of the couplings, e.g., on the nearest-neighbor coupling 
7i when all other J,- are fixed, and the critical exponents at the transition are continuously 
varying (in contrast to the constant exponents throughout the QLRO phase). 

Another interesting example of a system with long-range interactions is the Haldane- 
Shastry chain [165, 166], with frustrated interactions Jr = l/r^. It has a critical ground 
state similar to that of the standard Heisenberg chain, but, in field-theory language, the 
marginal operator causing the leading log-corrections vanishes [167]. The system is, 
thus, right at a dimerization transition such as the one discussed above for the J1-J2 
chain. It can be noted that J2/J1 = 1/4 in the Haldane-Shastry model, which is quite 
close to the critical ratio gc = 0.2411 of the J1-J2 chain. Thus, the interactions beyond 
distance r = 2 in the Haldane-Shastry chain only play a minor role (but, importantly, 
actually make the system exactly solvable [165]). 
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FIGURE 40. Ground state phases of the chain with frustration parameter g and exponent a in the long- 
range interaction, reproduced from Ref. [73]. The dashed curves indicate continuous phase transitions, 
whereas the thick solid curve represents a first-order transition. The thin solid curve for g <Q corresponds 
to the system with unfrustrated interaction studied in [72]. Ata^^ =0 (the J1-J2 chain) the dominant spin 
correlations in the VBS state change from <7 = ;r to n/2 at g Ri 0.52 [158]. The curve where this change 
occurs for a^' > connects to the multi-critical point where all the phase boundaries come together. 



In the presence of long-range interactions one can also realize a direct ID quantum 
phase transition between a Neel state and a VBS. Here we discuss the system introduced 
in Ref. [73], which combines unfrustrated long-range interactions and short-range frus- 
tration (at separation r = 2) according to the hamiltonian 

N/2 N 

H= Y,JrY.SrS,+r, (200) 

r=l !=1 

where the distance dependence of the couplings is given by 

/i = ^, J2 = g, 7.>2 = -^^, J^=^ + L^- (201) 

Here, with the normalization by 7£ of all couplings but J2, the sum of all non-frustrated 
interactions \Jr\ equals 1, and the limit a — )• 00 therefore corresponds exactly to the Ji- 
J2 chain with g = Ji/Ji - This will be useful when comparing the two models and also 
guarantees a finite energy per spin for N ^ °o, even for a < 1 . Instead of summing Jr 
up to r = N/2, one could also include N/2 < r < N. This should not affect the phase 
boundaries and critical exponents for a > 1 , however. 

We will study the evolution if the dimerization transition occurring as a function of 
the frustration strength g as the inverse of the long-distance interaction exponent 
is increased from 0. As shown in the semi-quantitative phase diagram in Fig. 40 (con- 
structed on the basis of Lanczos results, as will be discussed below) this continuous 
transition persists until a ^ 2, while for smaller a it evolves into a first-order transition 
between the Neel state and a state with coexisting VBS order and critical (or possibly 
long-ranged) spin correlations at wave-number q = Tt/2. This state is denoted in the 
phase diagram as VBS-i-QLR0(;r/2). 

Evolution of the dimerization transition. We have discussed how to study the dimer- 
ization transition based on crossings of excited-state energies for the J1-J2 chain, with 
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FIGURE 41. Low-energy levels of a 16-spin system at two values of the long-range exponent; a = 4 
(left) and a — I (right). The spin S and the momentum k of the states are indicated in the left panel (and 
note that two levels are shown with S — Q,k = Q). The inset in the right panel shows the avoided level 
crossing of the two k = singlets in greater detail. There is a very small gap between these singlets where 
they come together, but this is much smaller than what can be resolved in this figure. 



an illustration in Fig. 38(a). The same physics applies in the presence of the long-range 
interaction as well, if a is sufficiently large. This is shown for a 16-spin chain at a = 4 
in the left panel of Fig. 41. The lowest k = and k = k singlets should become degen- 
erate in the VBS phase forN^°° (so that a symmetry-broken dimerized states can be 
formed). A region of very near degeneracy for g > 1/2 can be seen in the figure. The 
region of approximate degeneracy, which is not easy to demarcate precisely, expands 
very slowly toward smaller g with increasing N. As shown in Fig. 42, the singlet-triplet 
crossing point converges with the system size [although slower than the l/N^ conver- 
gence shown for J1-J2 the chain in Fig. 38(b)] and can reliably give the QLRO(;r)-VBS 
phase boundary gc{(x) for a > 2. 

Another interesting feature of the energy levels is that, upon decreasing a below ^ 2, 
the broad maximum in the ground state energy versus g becomes increasingly sharp. As 
seen in the right panel of Fig. 41, at Ct = 1 it has developed into a sharp tip due to an 
avoided level crossing with the second singlet at = 0. The real singlet-triplet crossing 
has moved to the same region. An avoided level crossing between two states with the 
same quantum numbers, leading to a discontinuity in the derivative of the ground state 
energy with respect to g for N ^ is the hall-mark of a first-order transition. It should 
be noted that it is not just the two lowest singlets that exhibit this kind of avoided level 
crossing. Other low-energy states as well come together in the neighborhood of the 
transition, in a complicated cascade of level crossings — all of these should converge 
to a single point at the first-order transition when N ^ °°. The nature of the phases at 
this transition will be discussed below. First, let us investigate in more detail how the 
dimerization transition evolves from continuous to first-order. 

Fig. 42 shows the size dependence of the level crossing Across and the location gpeak 
of the maximum in the ground state energy. In the standard Majumdar-Ghosh frustrated 
chain the size correction to the crossing point is 1 /N'^ [as seen in Fig. 38(b)], which 
also can be seen for large a. For smaller a, the corrections instead seem to be 1/A^, 
but a cross-over to l/N^ for large seems likely as long as the transition remains 
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FIGURE 42. Dependence on the inverse chain length of the singlet-triplet crossing point Across and the 
location gpeak of the ground state energy maximum for different long-range interaction exponents a. The 
two lines show extrapolations of the g = 2.0 data to = oo. 



continuous. The peak location moves in the opposite direction. For some a and — )• oo, 
Across and ^peak should coincide. The results indicate that both Across and ^peak have 
dominant 1 /N corrections at this point. Line fits are shown in Fig. 42 at a = 2, were 
there is still a small gap between the two extrapolated values (and the extrapolations 
may not be completely accurate, if the linear scaling holds asymptotically only at the 
special a- value for which Across = ^peak)- At a = 1.7, where the transition is first-order, 
they should coincide (and then the asymptotic size correction should be exponential). 

First-order Neel-VBS transition. To confirm an avoided level crossing with a dis- 
continuous energy derivative for a > 1.8 (approximately), the second derivative of the 
ground state energy at its maximum is graphed on a lin-log scale in Fig. 43. These results 
were obtained based on calculations of the energy on a very dense grid of points close 
to the peak value, to which a polynomial could be reliably fitted. The second derivative 
extracted from this polynomial grows exponentially with N for a = 1.5, showing that 
the slope of the energy curve indeed changes discontinuously for an infinite chain. In 
contrast, at a = 3 the second derivative decreases for large A'^. For a = 2 convergence to 
a finite value also seems plausible, whereas a = 1 .7 and 1.8 appear to be close to a sepa- 
ratrix (where the form of the divergence is consistent with a power law) between the two 
different behaviors. This analysis suggests that the continuous dimerization transition 
changes smoothly into a first-order transition at {gi„ ^ 0.41, a,„ ^ 1.8). The singlet- 
triplet crossing moves toward the ground-state energy maximum and coincides with it at 
the multi-critical point (g^, «„), beyond which it develops into a first-order singularity. 

To analyze the states involved in the first-order transition, we next investigate the 
standard spin correlation function C(r) and the dimer correlation function D{r) defined 
in (199). We will also study the Fourier transforms of these correlation functions; the 
static spin (s) and dimer (d) structure factors: 



^72-1 



S,{q)=C{0)+C{N/2)+2 £ cos(^r)C(r), 



(202) 



r=l 
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N/2-\ 

Sj{q)=D{0)+D{N/2) + 2 £ cos{qr)D{r). (203) 

If there is long-range order at some wave- vector q = Q, then the corresponding structure 
factor is proportional to A'^ (for large A^), and order parameters can therefore be defined 
as Ss^d{Q)/N. Here 2 = ;r for both the Neel state and the dimerized VBS state. Fig. 44 
shows the g-dependence ofSd{7r)/N and 5.5(;r)/A^ at a = 1 .5. Discontinuities are seen to 
develop at g ^ 0.42, in agreement with the first-order scenario (and at g = 1, not shown 
here, the discontinuities are much sharper). Both the Neel and VBS order parameters are 
well converged to non-zero values in their respective phases, and small in the other phase 
(where they should vanish when — )■ oo, but it is not possible to observe this clearly 
because of the small system sizes). The spin structure factor at ^ = :;r/2 is also shown in 
the figure. Interestingly, it also becomes large in the VBS phase, while in the Neel phase 
it should decay to zero with increasing (which is plausible based don this data, but, 
again, not possible to see clearly for these small systems). Based on these results one 
might conclude that the VBS state obtaining at the first-order transition has co-existing 
long-range magnetic order at k = n/2 (i.e., a spiral with period four). Considering the 
small system sizes, this cannot be guaranteed, however. The behavior could also reflect 
a slow power-law decay of the spin correlations. 

Fig. 45 shows the real-space spin and dimer correlations at a = 1 for two g values, at 
either side of the first-order transition. Atg< gc, in the Neel state, the spin correlations 
are staggered and clearly long-ranged (with almost no decay seen as a function of r 
for r > 1). At this coupling there is no structure in D{r), i.e., there is no VBS order. For 
g> gc there is clearly VBS order, with D{r) showing the characteristic staggered pattern. 
Strong period-four spin correlations are also observed, but it is not clear whether these 
are long-ranged or decay slowly to zero with increasing r. The behavior is consistent 
with an \/r decay, but much longer chains would be needed to extract the behavior 
reliably. An interesting point to note here is that the dimer correlations oscillate around 
zero, even though no constant has been subtracted. This is in contrast to the oscillations 
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FIGURE 44. VBS and magnetic order parameters defined in terms of the size-normalized dimer 
structure factor Sd{n) (left) and spin structure factors Ss{7t) (middle) and Ss{7z/2) (right) versus the 
frustration strength g in systems with long-range interaction exponent a = 1.5. Discontinuities develop 
with increasing system size at the first-order transition. 



around a positive value in the J1-J2 chain, as seen in Fig. 36. Looking at the definition 
of the dimer correlation function in Eq. (199), this behavior is another manifestation 
of period-four spin structure, i.e., at odd separation of the bond operators S/ + S,+i, a 
correlation between a ferromagnetic and an antiferromagnetic bond is measured. While 
the nature of the magnetic structure in this VBS is not completely clear, the correlation 
functions seem to indicate a tendency to bond-ordering of the form \ststst- • •), where s 
and t represent nearest-neighbor bonds with high singlet and triplet density, respectively. 
In Fig. 40 the state has been denoted as magnetically quasi-long-range ordered, a VBS- 
QLR0(;r/2) coexistence state, but it could in fact also be a state with coexisting VBS 
and long-range magnetic order. It should also be noted that this phase could in principle 
be very sensitive to boundary conditions on small lattices, and it cannot be excluded that 
the actual magnetic structure is spiral-like, with continuously varying pitch (as in the 
J1-J2 chain with g > 1 [159]). 

The VBS-i-QLR0(;r/2) state should have gapless spin excitations, regardless of the 
spin correlations as long as they are not exponentially decaying. The lowest triplet is at 
k = n/2.lt is, however, difficult to demonstrate the gaplessness based on data for small 
systems, because the size dependence of the gaps (and other quantities) for system sizes 
N = An exhibit large even-odd n oscillations (as well as other irregular size effects). In 
the conventional VBS phase (in the low-right part of the phase diagram in in Fig. 40) 
the lowest triplet is at ^ = n, even when the spin correlations (which are exponentially 
decaying in this phase) are peaked dX k = Tt /2. The level crossing between the lowest 
k = K and k = k/2 triplets can therefore in principle be used to extract the boundary 
between the VBS and VBS-i-QLR0(;r/2) phases. The size dependence of the crossing 
point is not smooth, however, and cannot be extrapolated very reliably. The boundary 
between dominant k = Tt and k = Tt/2 spin correlations has also not been extracted 
accurately. This change in the spin correlations may be associated with a transition to a 
state with periodicity four [158], although there are no sign of the VBS order changing. 

QLRO-Neel transition. Let us return to Fig. 41 for another interesting feature of the 
level spectrum: The lowest singlet excitation for small g has momentum = ;r for a = 4 
but k = for a = I. The switching of the order of these levels as a function of a for 
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FIGURE 45. Spin (left) and dimer (right) correlations in a 32-spin chain at o: = 1 for two values of 
the frustration parameter At g ^ 0.25 and 0.45 the system is in the Neel and VBS-QLRO(;r/2) phase, 
respectively. The first-order transition in this case occurs aXg~ 0.39. 



g < gc is associated with the Neel-QLRO(;r) transition. The level crossings can be used 
to extract this phase boundary very accurately up to ^ ~ 0.25 (while for higher g the 
N ^ oo extrapolations become difficult). While it is not completely clear why the ^ = 
and k = K excited singlets cross each other at this transition, it can again be noted that 
the QLRO(;r) phase the low-energy excitations arise from two deconfined spinous. The 
lowest (for A'^ = An) is a triplet atk = K. The lowest singlet is also atk= n, with a small 
finite-size gap to the triplet due to weak spinon-spinon interactions (and Fig. 33 shows 
and example of how these levels become degenerate as N ^ °°). In the Neel phase the 
spinous are no longer deconfined, and the structure of the low-energy spectrum changes. 

As we will discuss below in Sec. 4.4.2, the 2D Neel state on finite lattices has 
low-energy excitations analogous to quantum-rotor states, which have S = 1,2,3,... 
and become degenerate with the ground state as the system size increases. They have 
momenta k = (0,0) and k = {k, Tt) for even and odd S, respectively. One might expect 
this kind of quantum rotor "tower" in the ID Neel state as well. Apparently, the lowest 
k = n singlet, which would not be part of such a rotor tower, is then pushed up much 
higher in energy, and the k = singlet takes over as the lowest singlet excitation. 

It is useful to compare the level crossing approach with the QMC calculation 
in Ref. [72] for the unfrustrated model with 7i = 1 and 7,>i = A(-l)''" V'""- A 
reparametrization of this model for A = 1, to the convention used in (201), gives 
the curve for ^ < shown in Fig. 40. Finite-size scaling of QMC data for the Neel 
order parameter gave a^- = 2.225 ± 0.025, for up to 4096 [72] . Extrapolating 
the k = 0,7t singlet crossing points (which here have size corrections oc l/N^, with 
/3 ~ 1.50) for N < 32 gives a marginally higher (and probably more reliable) value; 
ttc = 2.262 ±0.001. Analyzing the singlet and triplet gaps at the crossings, assuming 
A ~ N^^, gives the dynamic exponent z = 0.764 ±0.005, in very good agreement with 
Ref. [72]. For the frustrated model (201) on the QLRO(;r)-Neel boundary, the gap scal- 
ing in 1 /N give z ~ 0.75 for ^ up ~ 0.25, while for larger g it is not possible to reliably 
extract ttc and z this way, because of large scaling corrections and the absence of level 
crossings for increasingly large systems as the multi-critical point (g^ ~ 0.41 , a,„ ^1.8) 
is approached (for reference = 2.220 ± 0.005 for ^ = and 2. 170 ± 0.01 for g = 0.2). 
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At the multi-critical point z can be extracted using gaps at the singlet-triplet crossing 
point and the ground state energy maximum, which gives z ~ 0.8. It is, thus, possible 
that the dynamic exponent is constant, z 3/4, on the whole QLRO(;r)-Neel boundary, 
including the multi-critical point where all phase boundaries come together. 

Technical notes. The model we have studied here is rather challenging for a ID 
system, from a technical standpoint, because of the long-range interactions. The combi- 
nation of long-range interactions and frustration makes it difficult to apply other com- 
putational techniques. While efficient QMC techniques can be applied to systems with 
unfrustrated long-range interactions [168, 72], this is no longer possible in the presence 
of the frustrating J2 term, due to the sign problem (discussed in Sec. 5). The DMRG 
method [28, 29], on the other hand, can handle frustration but not easily long-range 
interactions. It would still be interesting to try to apply DMRG, or related techniques 
based on matrix-product states [21], to this system, in particular to study further the spin 
structure in the putative VBS-QLR0(;r/2) state. 

The Lanczos calculations presented above exploited all the symmetries discussed in 
Sec. 4.1. The hamiltonian was stored in compact form on disk, with a separate file for 
each interaction distance r in (201). These files were read and used one-by-one in each 
operation with the hamiltonian according to code {22}. For N = 32, the number of 
Lanczos iterations needed was typically 60 ~ 80 (and less for smaller systems). All basis 
states were saved in primary memory, and re-orthogonalization to all previous states was 
carried out after each iteration (as in code {19}). 



4.4. Two-dimensional systems 

We now discuss exact diagonalization in two dimensions, using the important case of 
the Heisenberg model on the simple periodic square lattice as an example. As in one 
dimension, we can use momentum conservation to block diagonalize the hamiltonian 
within the sectors of fixed magnetization. It is a little more complicated to take maximum 
advantage of lattice symmetries to further split some of the blocks, because of the 
larger number of symmetry operations commuting with the hamiltonian. On the positive 
side, the reward for implementing all the lattice symmetries is that, for high-symmetry 
momenta, where lattice reflections and/or rotations can be used, the blocks are smaller 
for given A'' than in one dimension. In practice, this may be of little help, since the 
maximum linear size L of an L x L lattice that can be diagonalized is still pitifully small 
(L = 6). Exact diagonalization studies are still of great value. 

In one dimension, we used the reflection (parity) operation P and constructed momen- 
tum eigenstates based on representatives \a) and their reflections, (1 -I- pP)\a) (and later 
we added also spin-inversion). Although P does not commute with the translation opera- 
tor, we found that for the two special momenta k = 0,7t, both k and p are still good quan- 
tum numbers. For general k, p is not a valid quantum number, but with semi-momentum 
states, mixing ±k states, we could use P to accomplish a real- valued representation. In 
two dimensions one can in principle also construct a real-valued hamiltonian for any 
momentum, but this is much more complicated in practice than in one dimension, and 
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FIGURE 46. Labeling of sites and bonds on a 4 x 4 periodic square lattice. With the bit representation, 
it is practical to start the site labels from 0, here according to i = x + yL with x,y G {0, ...,L— 1} with 
L = 4. A bond is here labeled according to the "left-down" reference site / to which it is connected; the 
horizontal and vertical bonds are bx — i and by ~ + i, respectively. 



in the end it is not worth the effort (since it does not increase the system sizes that can 
be studied). ^ Here we will therefore work with standard complex momentum states. 

Defining the lattice. As in one dimension, we use the state notation ISq, . . . ,5^ j), 
which is practical with the bit representation of the spins. For a ID system the spin 
indices correspond directly to the chain geometry, but for a 2D lattice we have to 
establish a labeling convention for the sites. We will consider rectangular lattices with 
N = Lj^x Ly sites, at coordinates (x, y) with x = 0, . . . , — 1 and j = 0, . . . , Ly — 1 . A 
natural choice is to number the sites z = 0, . . . , — 1 such that i = xi +yiLx, as illustrated 
for a 4 X 4 system in Fig. 46. In a notation not explicitly dependent on the lattice, we can 
write the Heisenberg hamiltonian as 

Nb 

H = jY,Hb)-^m. (204) 

b=\ 

where [i{b),j{b)] are the two nearest-neighbor sites connected by bond b. Specifying 
the lattice then just amounts to creating this list of site pairs. 

4.4.1. Momentum states in two dimensions 

We again define translations of the spin indices, now in both the x and y directions, as 
illustrated in Fig. 47, with corresponding operators and Ty defined by 

Tyl^Q, . . . ,S^_j) = |5^^_^Q^j, . . . ,Sy^|.^_j^), (205) 



A real maximally blocked basis exists because the hamiltonian is real and symmetric, which guarantees 
real eigenvalues and eigenvectors. If the hamiltonian is diagonalized numerically as a single block, the 
eigenvectors in degenerate multiplets will be mixed, and the translation operators will not be diagonal 
(because their eigenvalues are complex). The degenerate complex momentum states have been mixed in 
such a way as to make the linear combinations real. Such a superposition can in principle be applied to 
block-diagonalize the original hamiltonian matrix into semi-momentum like real-valued blocks. 
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FIGURE 47. Symmetries used for the two-dimensional square lattice. Tx and ZJ, translate the spins by 
one lattice spacing in the positive x and y directions, respectively. The lattice is periodic; the open circles 
represent sites of the opposite edges. Px and Py are reflections with respect to the x- and y-axis, and P^ 
and Pe reflect with respect to the two diagonal axes. For even L (which normally should be used for an 
antiferromagnet), the Px and Py axes pass between lattice sites. The reflections and Pg are about the 
lines connecting the far corners, which go through the sites on the diagonals (and hence leave the spins on 
those sites unchanged upon reflection). 



where the translated spin indices are 

Tx{i) = [xi-l]Lx+yiLx, 

Ty{i)=Xi + {[yi-l]L^)L,, (206) 

with [ji — denoting the modulus of jj — l with respect to Ly, i.e., [— l]Ly = Ly—l. 
Using these translations, a momentum state based on a representative \a) is defined as 

|a(k)) = \a{k,,ky)) = 4r L^'l (207) 

where the possible momenta are 

ky=—my, my = 0, 1, . . . ,Ly— 1, J ^ {x^j} . (208) 

The normalization constant Na depends on the translational properties of the representa- 
tive, i.e., the number of different states Da obtained among the group of x Ly transla- 
tions of the representative \a) (e.g., for a state with the spins in a checker-board pattern 
Da = 2). A representative is incompatible with the momentum if the sum of phases Fa in 
(207) over the translations bringing \a) onto itself vanishes. For a compatible state, the 
normalization constant A^^ = Da\Fa\'^. The easiest way to compute this in a program is 
simply to carry out all the translations and sum up Da and Fa in the process, instead of 
using explicit formulas as we did for ID systems (where one can of course also use the 
more brute-force approach). 

The construction of the hamiltonian matrix proceeds as in the ID case. We again split 
the hamiltonian into a diagonal piece Hq and off-diagonal bond operators //^ as in (121) 
and (122), where now j = I,. ..Nt,. Acting with these operators on the representative 
I a), we again may have to translate the resulting state in order to obtain the new 
representative \bj) corresponding to Hj\a), i.e., 

Hj\a)=hj{a)Ty^'^T^^^bj). (209) 
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Acting on a momentum state, we thus get 

HM%,ky)) = ^t'i:^'^'''^'^Ti'~'^^Tt'^\bj) (210) 

^ hj{a) i(k^l'^.+kyl])^y y -i{k^x+kyy)jyjx\r^\ 
fM~ I— I y X \ jh 

and taking the overlap with {bj{kxi ky) \ gives the matrix element 



{bikx,ky)\Hj\a{kx,ky))^hj{a)c-'^''^'^^^^^^^ (211) 

The normalization constants (or some integer mapping to their possible numerical val- 
ues) should be stored along with the representatives. 

Incorporating other lattice symmetries. We here consider a quadratic, L x L, system. 
There are eight independent rotations and reflections of such a lattice, which can be taken 
as, e.g., the four 90° rotations and a reflection about a horizontal or diagonal axis. It can 
be easily verified that additional reflections can be written as combinations of rotations 
and the first reflection. The easiest way to see this is to just consider all rotations and 
reflections of a 2 x 2 array; 

4 3\/3 2W2 \ \(\ 4 
1 2)14 ijls 4)12 3 



3 4W2 3W1 2W4 1 
2 ijll 4JI4 3JI3 2 



(212) 



Here the first row contains all the rotations of the first array, and in the second row those 
arrays have been reflected by exchanging the two columns. Any other reflection will just 
produce an array which is already in the above set of eight. Other permutations of the 
elements do not correspond to a combination of rotations and reflections (and hence do 
not correspond to symmetries of hamiltonians we are normally interested in). In practice, 
we can use any convenient set of reflections, or rotations and reflections, with which the 
eight unique transformations can be generated. 

Using translations as well as a set of other symmetry operators, which we for now 
collectively add together into a single operator Q, with corresponding quantum numbers 
{q\, a momentum state can be defined as 

l«"(k,M)> = I I e-(^^-+^^^)r;>r;^(2|a). (213) 

As we have seen above, there are more symmetry operations than unique transforma- 
tions. We can select the ones that are most convenient for given momentum. Instead of 
using the rotations of the square lattice, we will here use the reflections Px and Py about 
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the X and y-axis, respectively. These correspond to symmetries of the hamiltonian for 
any rectangular (L^ x Ly) lattice (and note that the 180° rotation is equivalent to PxPy). 
For generic momenta, the reflections do not commute with the translations, and there 
can be no further blocking using Px and Py, but there are high-symmetry momenta for 
which Px or Py, or both, can be used. For kx = ±ky we will instead use the reflections 
Pd and Pg about the two perpendicular diagonal axes. All the reflections are defined in 
Fig. 47. For the most special momenta, k = (0,0) and (;r, n), and Lx^Ly arbitrary, we 
can also use Pj (or Pe) in addition to Px, Py. 

To achieve maximal block-diagonalization for a given momentum {kx,ky), the opera- 
tor Q in (213) can be chosen according to: 

1, general k (a) 

i^+P.P.). k={0,ky),{n,ky) (b) 

i^+PyPy). k^{kx,0),{kx,K) (c) 

Q=< i^+PdPd), kx = +ky{Lx=Ly) (d) (214) 

[l+PePe), kx = -ky{Lx = L,) (e) 

( 1 + PyPy) ( 1 + PxPx) , k = (0, 0) , (TT, TT) , ^ (/) 

{l+PdPd){l+PyPy){l+PxPx), k^{0,0),{n,7t),px^py {g) 

where again all the reflection quantum numbers take the values ± 1 . Note the restriction 
Px — Py in case (g), i.e., there are no such states for px ^ Py. In that case, we can of 
course also use option (f), but using (g) splits the blocks further into two. 

To prove that the symmetry operators Q defined above indeed do have the correspond- 
ing conserved quantum numbers within the different momentum sectors, the following 
relationships between the lattice transformations are useful: 



(215) 



The remaining pairs are commuting operators; ?>■] = 0, [Tx.Py] = 0, [Ty.Px] = 0, 
= 0, [/'e,/'^] = 0. Let us just check the most complicated case, (g) in (214). The 
permutations commute with the translations for these special momenta, for the same 
reasons as discussed for ID systems in Sec. 4.1.4. We then only have to investigate the 
properties of Q. It is clear that the state is an eigenstate of Pj, that being the left-most 
reflection operator. To check whether it is also an eigenstate of Px and Py, we use some of 
the relationships in (215) to verify that PxQ = pxQ and PyQ = pyQ. After a little algebra 
we get 

Px{l+ PdPd) ( 1 + PyPy) ( 1 + PxPx) = 

Px{^+ PxPyPdPd)i^ + PyPy) ( 1 + PxPx) , (216) 

and an analogous result for Py. Thus, if px = Py, the state is indeed, for the special 
momenta (0, 0) and {n, n), an eigenstate also of Px, Py, and P^. 



Px^x — 


Tx-'Px 


PyTy = 


f-lp 

iy ry 


PdTx = 


TyPd 


PdTy = 


TxPd 


PeTx = 


Ty'Pe 


PeTy = 


Tx'Pe 


PdPx^ 


PyPd 


PdPy^ 


PxPd 


PgPx — 


PyPe 


PePy = 


PxPe- 
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If we want to study all momentum sectors, using the reflection symmetries does not 
buy us much, because in most of the blocks we have to use the generic option Q— I in 
(214)(a). These symmetries are useful when investigating the ground state, which often 
is in the maximally blockable sector (g). The most useful aspect of the reflections may 
still be just the fact that they give insights into the symmetry-aspects of the states (e.g., 
to classify excitations). Apart from the lattice symmetries, spin-inversion symmetry can 
be used in the = sector, exactly as in the ID case. 

We have considered combinations of reflection symmetries but could instead have 
used a combination of rotations and reflections. This would give us different quantum 
numbers, but the same size of the blocks. In group theory, there are irreducible represen- 
tations, with standardized names, corresponding to various lattice symmetries. Instead 
of working out the applicable symmetries for a new case, one can look up the irreducible 
representations and their corresponding character tables and employ these to construct 
states with quantum numbers of a chosen irreducible representation. This is very use- 
ful when studying more complicated lattices. Ref. [143] has a discussion of this more 
formal approach, with examples for chains and square lattices. 

We have here used symmetries applicable only to x Ly lattices with the edges along 
the X- and y axis on the square lattice. With Lx=Ly=L, which is the most interesting if 
we are interested in approaching the infinite 2D lattices, we are then limited to L = 4 and 
L = 6 (since odd L introduces undesirable frustration effects for an antiferromagnet). To 
increase the set of accessible lattice sizes A'^, one can also consider "tilted" lattices, with 
edges that are not parallel to the square-lattice axes [143]. 

Implementation. To study a 6 x 6 lattice we have to use long (8-byte) integers for the 
bit representation. To save time, it is ten better not to loop over all the 2^ possible state- 
integers and single out the ones corresponding to a given magnetization (as is done, 
e.g., in code {7}), but to use a scheme which from the outset only constructs states 
with a given (i.e., given number n^ of t spins). This can of course also be done for 
ID systems. Such a more sophisticated generation of specific states starts with the 
integer a = 2"t+i — 1, in which the first bits are 1 and the rest are 0, and then execute: 

do 

call checkstate (a, pass) {26} 
if (pass) then M = M + 1 ; store representative information endif 

c = 

dob=l,N 

ifia[b-l] = l)then 
i{(a[b] - l)then 

c = c+l 
else 

a[0],...,a[c— 1] = 1; a[c],. . . ,a[b — I] — 0; a[b] — l; exit 
endif 
endif 
enddo 

if (b = A^) exit 
enddo 
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TABLE 3. Sizes M(L) of the yfc = (0,0) state blocks for L x L lattices 
(L = 4,6) with magnetization = and different reflection and spin- 
inversion quantum numbers. For p^^ = —py there is no quantum number pj. 



For Pjc = 


-^,Py^ 


1 the block structure is the same as for px 


= hPy=-l. 


Px 


Pv 


Pd z 




Mil = 6) 


+1 


+ 1 


+1 +1 


107 


ij, 804,936 


+1 


+ 1 


+ 1 -1 


46 


15,761,166 


+1 


+ 1 


-1 +1 


92 


15,796,390 


+1 


+ 1 




38 


15,752,772 


-1 




+ 1 +1 


50 


15,749,947 


-1 




+ 1 -1 


45 


15,739,069 


-1 




-1 +1 


42 


15,741,544 


-1 






36 


15,730,582 


+1 




+1 


75 


31,481,894 


+1 






108 


31,525,574 



Here the inner loop is simply searching for the lowest bit position — 1, for which a set 
(1) bit of a can be moved one step to the left (i.e., to a position b where the bit currently 
is 0). After such a position has been found, all the previously set bits below this position 
(the number of which is kept track of with the counter by c) are moved to the lowest 
positions (0, ...,c^ 1). This bit evolution corresponds exactly to how the digits of a 
base-2 odometer advance from right to left. The contents of checkstate depend on what 
symmetries are used, but it would be very similar to the implementations we discussed 
for the ID case, apart from the fact that the translations [defined in (205)] are more 
complicated than just cyclic bit permutations and have to be implemented by hand. 

The normalization constant, needed when constructing the hamiltonian matrix ele- 
ments, should be delivered by checkstate (if a representative has passed the tests). The 
simplest way to compute the normalization (instead of using formal expressions as we 
did in the ID case) is again just to carry out all the symmetry operations of the represen- 
tative state and add up the sum Fa of factors (the complex momentum phases as well as 
the plus or minus signs from the reflection quantum numbers) in (213) for each symme- 
try operation bringing the representative onto itself. That number, along with the number 
Da of non-equivalent transformations of \a), gives the normalization A^^ = Da\Fa\'^ (and 
again Na = if the representative is incompatible with the quantum numbers). We do 
not discuss further details of how to implement the basis generation and the construc- 
tion of the hamiltonian matrix, as these tasks are straight-forward generalizations of 
the one-dimensional implementations discussed in Sees. 4.1.3 and 4.1.4. Spin-inversion 
symmetry in the — sector can be implemented as discussed in Sec. 4.1.5. 

Example of block sizes. Table 3 lists block sizes for the square-lattice systems of 
interest in Lanczos calculations, for one of the special momenta, k = (0,0), where the 
largest number of symmetries can be exploited. The ground state of the Heisenberg 
model is in this block (in the sub-block with all other quantum numbers equal to 1). For 
L = 6 even the smallest blocks have more than 15 million states, and the largest blocks 
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(for general k) are about eight times larger. Blocks of this size can still be handled in 
Lanczos calculations (the smaller one rather easily on a standard workstation). 

To test the basis construction, it is useful to check the sum of the sub-block sizes, 
which should equal the total number of states in the block. For L = 4 there are 822 states 
with k = (0,0), which equals the sum of all the reflection-block sizes in Table 3. 



4.4.2. The Neel state and its quantum rotor excitations 

We now illustrate 2D Lanczos calculations with results for the Heisenberg model. Al- 
though the small lattices accessible with this method are not sufficient for quantitatively 
accurate extrapolations to the thermodynamic limit, the calculations do illustrate some 
important aspects of systems with Neel order (beyond what we discussed in the frame- 
work of spin-wave theory in Sec. 2.1). We introduce the quantum rotor mapping of the 
low-energy states of finite systems, and based on these discuss the magnetic susceptibil- 
ity. We also calculate the sublattice magnetization. 

Two-spin model of quantum-rotor states. If there is antiferromagnetic order, the 
spins on sublattice A are predominantly oriented in the same direction, and the ones on 
sublattice B are predominantly in the direction opposite to those on A. If the number 
of spins N is finite and the symmetry is not broken, the over-all direction, defined, 
e.g., by the sublattice A spins, is not fixed, however. This situation can be captured 
by considering the sum of the spins on the individual sublattices A and B [169], 

Sa = 1^S,-, S5 = £S;, (217) 

ieA ieB 

as two fixed-length spins Sa — Sb — N/2 (more precisely we would write, Sa — Sb = 
rusN for large A^^, but the constant is irrelevant), as illustrated in Fig. 48. The two large 
spins are assumed to be antiferromagnetically coupled to each other in the simplest 
possible rotationally invariant way, which is through an effective Heisenberg interaction; 

Hab = JabSa • Ss = 5 (S^ — — S|) , (218) 

where S = Sa -I- Sb is the total spin. Here S^ and S| are just constants proportional to A^^, 
which can be neglected when we discuss excitation energies. However, these constants 
imply hat the coupling constant Jab should be l/N,m order for the total ground state 
energy to be oc N. The ground state of (218) has total spin 5 = and excitations with 
5 = 1 , 2, ... at energies JabS{S + 1 ) /2 above the ground state. 

We define a new A^-independent coupling Jeff = NJab/2 and write the energies as 

^s = jJ-^^- (219) 

These excitations are referred to as the tower of quantum rotor states. The states with 
S <S become degenerate as — > oo, and combinations of them can then be formed 
which are ground states with fixed direction of the Neel vector (in analogy with the infi- 
nite number of momentum states required to localize a particle in quantum mechanics). 



Computational Studies of Quantum Spin Systems 



January 18, 2011 



120 




Sa 



Sb 



FIGURE 48. Effective description of the rotationally invariant Neel vector in.s in terms of two large 
spins, Sa, Sg, corresponding to the sum of the spins on the two sublattices. There is an effective antiferro- 
magnetic coupling between these spins, leading to a singlet ground state and a "tower" of quantum rotor 
excitations of total spin S ^ 1,2,... at energies A5 ^ S{S +l)/N above the ground state. 



thus allowing for the symmetry breaking that is the starting point for spin- wave theory. In 
the thermodynamic limit, the direction of the ordering vector is fixed (as the time scale 
associated with its rotations diverges [169]), and the quantum rotor-states are then in 
practice not accessed. They are neglected in standard spin-wave calculations (discussed 
in Sec. 2.1) from the outset because the order is by construction locked to the z direction. 
One can still access the rotor energies in spin-wave theory, by considering systems in 
an external magnetic field, tuned to give a ground state with total magnetization 5~ = 5 
[170, 171]. The rotor states are of great significance in finite clusters. 

The effective coupling 7eff in (219) for a given system can be determined if we can 
relate it to some physical quantity which depends on the rotor excitations. An obvious 
choice is the uniform magnetic susceptibility, x = d{m^) /dh. Calculating it for the two- 
spin model when T — )■ gives X = ^ /Jeff- For the real Heisenberg model on a finite 
cluster in dimensions d >2, x should be dominated by the quantum rotor states when 
T <^ 1/L, because the lowest spin wave energy scales as oc l/L (while the quantum 
rotor states scale as l/U^)- Thus, we can write the effective quantum rotor tower for a 
Heisenberg model with Neel ground state as 

3XN 

where x should be evaluated in the limit N (first) and T — )■ 0. Note that / = (3 /2)Nx 
here plays the role of a moment of inertia, giving an analogy between (220) and the 
energy spectrum of a rigid rotor in quantum mechanics. 

The relation (220) can also be used as a way to compute the susceptibility of a 
Heisenberg models numerically; by extracting the lowest energies as a function of S (for 
small S, where the quantum-rotor mapping should apply). More precisely, the small-5 
energies gives an estimate for x as the — )■ 00, 5 — )■ limit of the quantity x{S,N): 

1 _Ms(Es~E„) p^jj 



X(S,N) S(S+i) 

Here Es denotes the lowest energy for total spin S. Note that we have to subtract the 
ground state energy (5 = 0) because in the two-spin effective model we only computed 
the excitation energies A5 with respect to the ground state energy (and the latter is not 
given accurately by the two-spin model). One would expect an 5-independent behavior 
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FIGURE 49. Lanczos results for the Heisenberg model on 4 x 4 and 6x6 lattices, (a) The energy 
relative to the ground state of the lowest state for each total spin sector, normalized by NS{S+ 1), so 
that perfect quantum rotor excitations should produce S- and A^-independent values, (b) Spin correlation 
versus distance for all possible distances r on the periodic lattices. The known r ^ °o value (from the 
QMC results in Fig. 5) is indicated by the dashed line. 



only for 5 ^ L, as the higher rotor states should be influenced by effects not taken into 
account in the two-spin model. 

In analogy with the low-energy ID quantum numbers discussed in Sees. 4.1.6 and 
4.3.1, on the 2D square lattice (where N = 4n for all even L) the quantum rotor states 
correspond to momentum (tt, ;r) and (0,0) for odd and even S, respectively. For even 
S, the lowest states have reflection quantum numbers P;,: = = = 1 in (214), while 
for odd S the appropriate quantum numbers are = Py = —I, Pd = ^- The lowest state 
for given S can be obtained in the magnetization sector = S (and since 7^ we 
cannot use the spin-inversion symmetry here). The (5 = 0) ground state is in the fully 
symmetric sector; the momentum is (0, 0) and Px = Py = Pd = z = I. 

Lanczos results for L = 4 and L = 6 are shown in Fig. 49(a). There are clearly 
large corrections to (220), as there is a significant decrease in x^^{S,N) with S and an 
increase with A^. For fixed A^, the difference between 5=1 and S = N/2is roughly 10%. 
Considering the fact that the limit S <^ L cannot really be studied based on the small 
lattices, deviations of this order are not surprising. The rotor states have been studied 
using QMC calculations for much larger lattices [171, 33]. The most precise calculation 
for smaU S and large gives x^HS,N) 22.8 [33]. 

To understand the deviations from the rotor picture, one can use the analogy of a 
slightly non-rigid quantum rotor, which seems natural considering that the two- spin 
model is defined with fixed-length spins, while clearly in the real system the sublattice 
spins fluctuate (in a way which can depend on the total spin S). It may be possible to take 
these effects into account by adding higher-order terms (S^ ■ Sg)^, etc., in the interaction 
(218). Details of this extended two-spin picture have not been worked out, however. 

Transverse susceptibility. For an infinite system at T = the spin-rotational symme- 
try is broken. One can then consider transverse and longitudinal components (with re- 
spect to the Neel vector), x± and X\\^of the susceptibility. In the two-spin model it is clear 
that X\\ = and this is also true in the Heisenberg model. In a large system in which the 
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symmetry is not broken, the components are rotationally averaged, giving % = {2/3)x±- 
The transverse susceptibility is relevant in practice in real Neel order quasi-2D systems 
(such as the high-Tc cuprates) at T > 0, because of anisotropics and/or 3D couplings due 
to which the system can have a finite ordering temperature. The energy scales associ- 
ated with the ordering are small, however, and the T — > value of x± = (3/2);^ of an 
isolated 2D plane (modeled by the Heisenberg antiferromagnet) gives a good estimate 
of this quantity in the more complicated system. 

Sublattice magnetization. The first attempt to compute the sublattice magnetization 
of the 2D Heisenberg model was made using Lanczos calculations by Oitmaa and Betts 
in 1978 [172]. At that time, the 6 x 6 lattice was beyond reach, and extrapolations based 
on smaller lattices resulted in a value that is too large. Fig. 49 shows the spin correlation 
function at all possible separations r on 4 x 4 and 6x6 lattices. There is a slow decay 
with r, and it at least appears plausible that the results are approaching a value (m^) > 
as r — )^ oo (and N ^0°). QMC calculations, results of which were already shown in Fig. 5, 
give the r — )■ 00 result indicated with a dashed line in Fig. 49. The existence of long-range 
order in the 2D Heisenberg model was, in fact, under debate until the first reliable QMC 
simulations were done in the late 1980s [173]. 

One can attempt to extrapolate Lanczos results XoN for, e.g., frustrated systems, 
were other calculations are very challenging [98, 99]. The results must in general be 
viewed with caution, however, because it is assumed that the behavior for very small 
lattices are already exhibiting the ultimate asymptotic behavior. Fitting procedures based 
on "Betts clusters" [174] of different shapes may then be misleading, even when the size 
dependence is smooth. There are examples where cross-overs occurs at larger distances 
[82], due to the presence of some finite length-scale in the system, which has to be 
exceeded by L before asymptotic behavior can be observed. 

5. QUANTUM MONTE CARLO SIMULATIONS AND THE 
STOCHASTIC SERIES EXPANSION METHOD 

Feynman's path integral formulation of quantum statistical mechanics [175] has played a 
major role in the development of QMC methods. In the case of spin systems and related 
lattice models, methods based on the path integral in imaginary time are commonly 
referred to as world line methods [176]. These techniques were originally based on 
an approximate discretization of imaginary time — the Suzuki-Trotter decomposition 
of the Boltzmann operator exp(— /3//) [177, 178]. Later, exact algorithms operating 
directly in the imaginary time continuum were developed [179, 32]. The first practically 
useful QMC method did not, however, use the path integral. Already in the early 1960s, 
Handscomb developed an approximation-free method for the Heisenberg ferromagnet 
based on the power-series expansion of exp(— /3//) and exactly computable traces of 
products of permutation operators (in terms of which the Heisenberg exchange can 
be written) [180, 181]. Although this scheme was also generalized to the Heisenberg 
antiferromagnet [182, 183, 184] and some other systems [185], world line methods 
were in general more efficient and dominated the field for a long time. The power-series 
approach to QMC calculations was revived with the introduction of a more generally 
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applicable exact stochastic series expansion (SSE) formulation [186, 187], in which 
also the traces are sampled (thus circumventing the previous reliance on permutation 
operator algebra). In addition to improved computational utility and efficiency, this 
generalization of Handscomb's approach also shows clearly how closely the discrete 
power series is related to the path integral in continuous imaginary time. Modem QMC 
algorithms based on the two formulations are also in the end rather similar [33, 31]. In 
particular, in both cases the spin configurations for some models (such as the Heisenberg 
model) can be sampled using highly efficient loop-cluster updates [188, 189, 179, 190], 
which are generalizations [191] of the classical Swendsen-Wang [118] cluster algorithm 
to quantum systems. Further generalizations of the loop concept to "worms" [32] and 
"directed loops" [190, 33] have an even wider applicability (e.g., in the presence of 
external fields), and have enabled large-scale studies of a broad range of quantum spin 
and boson models. Some of these can now be studied at a level of detail approaching the 
state of the art for classical systems. 

In the case of frustrated spin systems, QMC methods are in general hampered by 
"sign problems", i.e., a non-positive definite path integral or series expansion [192]. 
While calculations can still be carried out in principle, and there has been some progress 
in controlling the sign fluctuations at high temperatures [193], the statistical errors due to 
the mixed signs become uncontrollable at low temperatures. Apart from discussing the 
origins of the sign problem for frustrated models, we will here only consider unfrustrated 
(bipartite) systems. Despite constituting just a subset of the systems which we are in 
principle interested in, such models still exhibit a wealth of interesting physics and 
continue to provide important insights in cutting-edge research. 

In these notes we will discuss implementations and applications only of the SSE 
approach, which for spin systems is normally more efficient and technically simpler 
than the continuous-time path integral. It is still useful to understand the relationships 
between the two schemes. In Sec. 5.1 we therefore first review path integrals in quantum 
statistical mechanics and world line QMC methods, in both discretized and continuous 
imaginary time. We introduce the general series expansion formulation of quantum 
statistical mechanics in Sec. 5.1.3 and investigate its mathematical relationship to the 
path integral. An efficient implementation of the SSE method for the 5 = 1 /2 Heisenberg 
model is described in Sec. 5.2. Illustrative calculations and results for several ID and 
2D systems are discussed in 5.3; single chains in 5.3.1, ladder systems in 5.3.2, the 
uniform 2D lattice in 5.3.3, dimerized systems in 5.3.4 (focusing on the quantum phase 
transition between the Neel state and a non-magnetic state), and J-Q models in 5.3.5 
(with examples of both continuous and first-order Neel-VBS transitions). 



5.1. Path integral and series expansion formulations 
of quantum statistical mechanics 

The main technical problem in quantum statistical mechanics is how to deal with the 
Boltzmann operator exp(— j8//). As we saw with exact diagonalization methods in the 
previous section, a direct construction of the corresponding matrix becomes infeasible 
for systems with more than a few tens of spins. The path integral method offers a way 
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to transform the trace of this operator (the partition function) into a form that can be 
sampled using Monte Carlo methods. An alternative is to start from a power-series 
expansion of the exponential. Here we introduce both these approaches as a foundation 
for implementing the SSE method for Heisenberg models in Sec. 5.2. 



The starting point of the path integral formulation is to write the exponential operator 
at inverse temperature j8 as a product of L operators with = j8/L in the exponent; 



The trace can be expressed as a sum of diagonal matrix elements in any basis. We 
can also insert a complete set of states between each of the exponentials. The partition 
function then takes the form of an L-dimensional sum of products of matrix elements; 



^^EE--- E (ao|e-^^^|aL-i)---(a2|e-^^^|ai)(ai|e-^^^|ao). (223) 



Formally, the exponential operator is equivalent to the Schrodinger time evolution oper- 
ator exp (—///?) (with h= 1) at imaginary (Euclidean) time t = — /A^. One can therefore 
consider (223) as a mapping of a ^^-dimensional quantum system to an equivalent sys- 
tem in J + 1 dimensions, where the new dimension is imaginary time.^ The state index 
/ corresponds to a discrete set of imaginary time points T/ = /A^, with < T/ < j8 and 
periodic time-boundary conditions («/, = OCq) for the states. Often the discrete times are 
referred to as "time slices". 

From a technical perspective, the purpose of writing Z in the form (223) is that, 
while the matrix elements of exp(— j8//) are difficult to evaluate, we can use some 
approximation to evaluate the matrix elements of exp(— A^//) when A^ is small (the 
number of time slices is large) . We can then compute the weights for the different time- 
periodic "paths" 0£o — > oti • • • ^ ocl-i Oq over which the system can evolve in the 
chosen basis. In QMC calculations, these paths are importance-sampled according to 
their weights in (223). 



° Often the equivalent (c/+ l)-dimensional system is referred to as an equivalent classical system. This 
terminology is, however, appropriate only in cases where all the path weights are positive, which we have 
not yet ascertained. It will be true for some classes of systems only, and those are the ones for which 
QMC calculations can be performed in practice. In some cases, e.g, the transverse-field Ising model, the 
effective model is an anisotropic version of the classical model in c/ + 1 dimensions. In most cases, the 
equivalent classical system is not, however, the same kind of model as the original one with just one more 
dimension — typically the path integral corresponds to some completely different statistical mechanics 
problem md + \ dimensions, with no apparent resemblance to the original <i-dimensional system. In some 
cases, one can, however, show that the quantum system is equivalent, on large length and time scales, to 
the same classical system in + 1 dimensions, e.g., the 2D quantum Heisenberg anliferromagnet at low 
temperatures has the same properties as a 3D classical Heisenberg model |5|. This mapping is normally 
carried out using a basis of coherent spin states, as discussed, e.g., in the book by Auerbach [4]. 



5.7.7. The imaginary -time path integral 




{111) 
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To discuss the possible space-time paths and their weights, let us first consider a 
seemingly extreme method of approximating the exponential operator by just its Taylor 
expansions to linear order in Af//, i.e., 

Zw Y,i^\'^-^^^\^L-i)---{oc2\l-\H\ai){ai\l-A^H\ao), (224) 
{«} 

where {a} refers collectively to all the states |ao), • • • , \ ocl-i)- Since each factor now 
has an error of order and there are L = /3 /A^ factors, the relative error in Z at 
fixed j8 is of the order A^. The leading neglected term in each expanded factor also 
contains H^, and we might therefore suspect that the discretization error should also 
scale as A^^. However, to find exactly how the error scales with j8 and A'^ requires a more 
careful analysis than just naively counting the neglected terms at the level of {H^) oc 
and L = j8/A^, because the paths contributing to the error are not the same as those 
contributing to the approximate Z written as (224). Based on arguments discussed later, 
to achieve an error which is independent of /3 and A' we should expect to use at least of 
the order of time slices in the linear approximation (224), i.e., A^ ex 1 /N. 

Far better approximations of the time slice operator can be used to reduce the number 
of slices (to independent of A'^ and proportional to /3, with an error of A^ or smaller). 
However, for the purpose of illustrating many basic aspects of the path integral and 
QMC methods based on them, it is convenient to first use the linear approximation. 
In fact, as we will see shortly, it is even possible to take the limit A^ — )■ within this 
approximation and formulate QMC algorithms based on the exact continuous time path 
integral [179, 32, 33]. We therefore discuss the linear version in some more detail, 
although in practice it is not recommended to implement an actual program using 
this scheme with A-j- > 0. After understanding the properties of the paths in the linear 
approximation it will be easy to understand how to take the limit A^ — )■ or use a higher- 
order discrete approximation, such as the Suzuki-Trotter decomposition discussed in 
Sec. 5.1.2. 

Boson path integral and world lines. At this stage it is better to continue the dis- 
cussion with a particular hamiltonian in mind, in order to have a concrete example of 
the paths and how one might go about sampling them. Consider first a boson system 
with only kinetic energy. We here work on the lattice,^ and the purely kinetic-energy 
hamiltonian can be written as (with an unimportant prefactor set to 1); 

H = K = -Y,Kij^-Y^ {a+at + a+a,) , (225) 
ihj) {hi) 

where af and at are, respectively, boson creation and destruction operators on the 
sites i — l,...,N, and (/,_/) is a pair of nearest-neighbor sites on an arbitrary lattice 
(in any number of dimensions, although for ease of visualization we will explicitly 
consider a ID chain). For simplicity, we will consider hard-core bosons, for which the 



' See [194] for a review of boson path integrals in continuous space, and [195] for more recent progress 
on efficient QMC algorithms based on them. 
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FIGURE 50. Graphical representation of terms in the discrete path integral for a system of three bosons 
on a chain with 14 sites. There are 20 time slices, labeled by / = 0, . . . , 19, which correspond to the states 
!«/) in Eq. (224). The bosons are represented by circles. They are connected to show more clearly the 
formation of world lines on the space-time lattice. Note the periodic boundary conditions in the time 
(vertical) direction, i.e., the first and last {I — 0) states are the same. Note also that with the linear time 
slice operator (1 — Ai/), there cannot be more than one particle hop (diagonal line segment) within a time 
slice. With periodic spatial boundary conditions, the identical particles can undergo cyclic permutations 
(winding), as in the configuration to the right. 



site occupation numbers are restricted to n,- = 0, 1 (which can be thought of as arising 
from a very strong on-site repulsion). Hard-core bosons also have a simple relationship 
to S = 1/2 quantum spins. Occupied and empty sites correspond directly to t and \, 
spins, and with no interactions the boson hamiltonian is equivalent to the XY model, 
with Hij = — (5j^5j +S'^S^). There is no chemical potential in (225) — we use the 
canonical ensemble with fixed number of bosons Nb (density p = Nb/N or, in the 
spin language, magnetization m = p — 1/2). Later, we will see that it is easy to also 
incorporate interactions. 

We work in the basis of site occupation numbers n,, / = 1,...,A^. Consider a ma- 
trix element (a/_|_i|l + A^^/jla/), which appears in Eq. (224) when we also write each 
instance of // as a sum over all the individual hopping terms Kij (and note the cancel- 
lation of the minus signs). There are then two possible relationships between the states 
\ai) and resulting in non-vanishing matrix elements; either |ci;/+i) — \oci+\) or 

|oc/+i) = Kij\ai). In the latter case, there must initially be a particle at site /, which is 
moved by Kij to a previously empty site j, or vice versa. Since these conditions must 
hold for all consecutive matrix elements, / = 0, . . . ,L — 1,0, each particle has to fol- 
low a "world line" (a term borrowed from the similar concept of a space-time path in 
relativity theory), which at each step of the imaginary time propagation either stays at 
the same spatial position or moves (jumps) by one lattice spacing. Two such world line 
configurations for a ID system are illustrated in Fig. 50. 

Note again the periodic boundary conditions in the imaginary time direction, which 
follow from the trace over ocq in (224). These boundary conditions apply to the boson 
states (occupation numbers), but not strictly to the world lines. Since the bosons are 
identical particles, they can be permuted in the course of their evolution from T = to 
T = /3 and still fulfill the required time-periodicity. The only possible permutation in a 
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ID hard-core system with nearest- neighbor hopping is a cyclic permutation involving 
all the particles on a periodic chain, resulting in a net particle current around the 
ring. An example of such a "winding" configuration is shown to the right in Fig. 50. 
Other permutations can take place with soft-core bosons (i.e., with no restriction on the 
occupation numbers rii), and also hard core bosons in two or three dimensions (and even 
in one dimension if hopping beyond nearest neighbors is included), but only winding 
results in a net current. The net number of times world lines wrap around the system 
(the total current divided by the system length) is called the winding number. It is a 
topological quantity, characterizing a non-local property of the configuration. In higher 
dimensions, there can be winding in each of the spatial directions, with corresponding 
winding numbers. An interesting aspect of winding in bosons systems is that it (a 
properly normalized variance of the winding number) corresponds to superfluidity [175, 
196]. In spin systems, winding is related to the spin stiffness, or helicity modulus, as we 
will discuss further below. 

We now denote by H'dci;}) the weight of a proper world line configuration {a} (i.e., 
one satisfying the constraints discussed above) in the partition function; 

Z=£w^({a}). (226) 
{«} 

The weight has a very simple form in the case considered here: Each kinetic operation 
Kij is multiplied by the time step A-;: in (224), and the matrix elements of the boson 
operators is always 1 in the hard-core case. The path weight is therefore 

W({a}) = Af, (227) 

where nK is the total number of kinetic jumps, corresponding to diagonal world line 
segments in Fig. 50. This weight can be simply generalized to the case of soft-core 
bosons, where the creation and annihilation operators are associated with factors ^Jni 
and \/ni — 1. Below we will also consider interactions between the bosons, which lead 
to a more complicated factor multiplying the kinetic contribution (227). 

In a QMC simulation, one makes changes in the world lines in such a way that 
detailed balance is satisfied with the configurations distributed according to W({a}). 
Local updates of world lines are illustrated in Fig. 51. Note that while these local 
moves are ergodic within a sector of fixed winding number, they cannot change the 
topological winding number. In higher dimensions, local moves of individual world lines 
also cannot lead to any of the other permutations that need to be included. Local updates 
involving two world lines can be used to sample permutations, but are not enough to 
change the winding numbers associated with periodic boundaries. Algorithms based 
on simple local updates were historically successful in studies of some systems, but 
more sophisticated and powerful loop updates [31], and related updating methods in an 
extended configuration space [32, 33], have been developed more recently. These are 
much more efficient in evolving the world lines in an ergodic way (instead of "getting 
stuck" or spending long times in some restricted part of the contributing configuration 
space) and can also lead to winding number changes. 

In this section we are primarily concerned with the path integral representation, 
and we we will not yet address the practical implementation of world line sampling 
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FIGURE 51. Local updates of world lines, which change the number of kinetic jumps (diagonal world 
line segments) by ±2. In (a) the diagonal segments are directly following each other, whereas in (b) they 
are separated by one time slice. One can consider moves with arbitrary separation of the jumps. 



(which we will do in detail only for the rather similar configurations arising in the 
series expansion formulation, although we will later in this section return to some more 
quantitative discussion of world line sampling as well). Next, we consider estimators for 
expectation values that we may want to compute. This will also give us some deeper 
insights into the properties of the world lines themselves. 

Expectation values and their world line estimators. Consider an arbitrary operator 
O and its thermal expectation value; 

(O) = lTr{Oe"^^}. (228) 

Proceeding with the numerator as we did for Z in Eq. (223), the exact time-sliced form 
of the expectation value can be written as 

(^) = ^ E (ao|e-^^^|aL-i) • ■ • (a2|e-^^^|ai)(ai|e-^^^0|ao). (229) 
{«} 

We would like to express this expectation value in the form appropriate for Monte Carlo 
importance sampling; 

^ ' E«,)W({a}) ■ ^ ' 

This is not always possible, however, because the paths contributing to the numerator in 
(229) may be different from those contributing to Z. If V7({a}) = for a configuration 
contributing to the expectation value, then no estimator 0{{a}) can be defined in (230). 
One then has to proceed with the calculation in a different way. We begin by discussing 
classes of expectation values for which (230) does apply. 

Diagonal operators. For quantities that are diagonal in the occupation numbers, e.g., 
a density correlator {ninj), the form (230) is trivially valid, because 0\(Xq) = 0{(Xo)\(Xo) 
in (229), where 0{(Xo) denotes an eigenvalue of O. The estimator can then be taken 
as 0{{a}) = 0{(Xo). This form of the estimator for a diagonal operator remains valid 
regardless of how the time-slice evolution operator e^^^^ is approximated. Because of 
the cyclic property of the trace, by which the operator O can be inserted anywhere in the 
product of time slice operators in (229), one can also average over all time slices and use 



0{{a}) = jY,0{ai), (231) 
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which normally improves the statistics of a simulation. In practice, to save time without 
significant loss of statistics, one may only perform a partial summation in (231) over, 
e.g., every Mh time slice (normally L > N), because eigenvalues 0{ai) and 6>(a;+i) of 
nearby states differ very little (at most corresponding to one single world line jump). 
States separated hy N slices typically differ significantly and contribute independent 
statistics, at least to some degree (exactly how much can be statistically quantified in 
terms of an imaginary-time dependent correlation function (O(t)O(O))). 

The kinetic energy. Expectation values of off-diagonal operators are in general more 
complicated. The kinetic energy (K) is an exception. In addition to being easy to 
evaluate, it is also a quantity of particular interest for understanding the properties of the 
path integral. At the level of the linear approximation of e~^^^ in Z, we can approximate 
j^^-ArH j-j^g form (229) with O = Khy just K, because this is done only at a single 
time-slice and leads to a relative error of order A-j:. This is of the same order as the total 
error from the linear approximation made at all the other time slices in both (224) and 
(229). Given a configuration {a} that contributes to Z, the estimator for a specific kinetic 
operator Kij is then 

'^^^ ^' (ai|l-A^i^|ao) 

Here the numerator is non-zero only if there is a world line jump between sites i and j at 
the first time slice. In that case Kij{{a}) — l/A^ = L/j8. In all other cases the estimator 
vanishes. We can again average over all time-slice locations of the operator Kij in (229), 
which results in 

(^,7) = ^, (233) 

where nij denotes the number of kinetic jumps in the world line configuration between 
sites / and j. Thus, the total kinetic energy is given by the average of the total number of 
kinetic jumps nK\ {K) = —{nKi/fi- 

We could have derived the expression for the kinetic energy in a simpler way, by ap- 
plying the thermodynamic formula for the internal energy (here just the kinetic energy); 
E = d ln(Z) /^j8 , with Z given by Eqs. (226) and (227). The more complicated derivation 
illustrates explicitly how the form (230) involves matching the configuration spaces of 
the numerator and denominator. This matching is not possible for generic off-diagonal 
operators; only for ones that are part of the hamiltonian. Before considering other cases, 
let us discuss another important aspect of the kinetic energy estimator. 

The utility of the expression (233) is not just that it enables us to compute the kinetic 
energy. It also carries with it a fundamental message about the path integral and the 
nature of the world lines. It is natural to ask how typical world lines will evolve as we 
increase the number of time slices. Specifically, how many kinetic jumps can we expect 
in a typical world line configuration? 

At low temperatures the kinetic energy should be almost temperature independent 
(approaching a constant as j8 — )■ °o). It should be proportional to the lattice size N. Using 
Eq. (233), we can therefore deduce the expected size and temperature scaling of the 
number of kinetic jumps as 

{nK) = -{K)^^N^. (234) 
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FIGURE 52. Continuous-time world line configurations. Here the kinetic events (jumps) occur at 
arbitrary imaginary times < T < j3. Configuration (a) contributes to the partition function, whereas 
(b) includes a pair of creation (solid circle) and annihilation (open circle) operators separated by more 
than one lattice spacing and does not contribute to Z or diagonal expectation values. Such a configuration 
instead contiibutes to an off-diagonal expectation value (alaj). 



This tells us that there is a typical scale of roughness of the world lines; for large L, the 
typical time separation between jumps is ~ 1 /N. As mentioned in the beginning of the 
section, we should expect to use of the order Nji time slices in a discrete path integral 
with the linear approximation of the time slice operator to avoid errors growing with A'^ 
and /3. We now have the explanation for that, because it is clear from (234) that a smaller 
number of time slices cannot accommodate the number of kinetic jumps necessary for 
the correct behavior of the kinetic energy (and then indirectly any other quantity). 

The relationship between (nx) and (K) is also important from another perspective; 
it indicates that it should be feasible to formulate simulation algorithms directly in the 
continuum limit — even if we let the number of time slices L — )■ oo, the number of kinetic 
jumps ("events") in the contributing configurations stays finite. The continuous -time 
world lines can then be stored in the form of just a single state |ao) and the nfc events; 
the times T,, / = 1 , . . . , at which they occur and the direction of each jump. With {n^) 
scaling as it should be possible to construct Monte Carlo sampling algorithms with 
this scaling in system size and temperature of the time and memory requirements. 

Examples of world line configurations in continuous time is shown in Fig. 52. The 
configuration to the left contributes to the partition function, whereas the one to the right 
does not, because it does not satisfy all time-periodicity constraints. It instead contributes 
to the expectation value of an operator afaj, the Fourier transform of which is the 
momentum distribution function p{k) = (a^ak). We briefly discuss such off-diagonal 
quantities next. 

Off-diagonal operators and broken world lines. If we consider a path {a} contribut- 
ing to Z and proceed to treat the expectation value of a general off-diagonal operator 
afaj in the same way as we did for the kinetic terms in (232), it is clear that we always 
get zero, unless z, j are nearest neighbors (in which case the operator is part of the kinetic 
energy). On the other hand, we can construct other paths, which do contribute to (a^aj) 
but not to Z, like the one in Fig. 52(b). If the sites /, j are separated by more than one 
lattice spacing, the expectation value (afaj) cannot be written in the standard Monte 
Carlo sampling form (230), because VK({a}) = for all 0{{a}) ^ 0. It is still possi- 
ble to evaluate {a'^at), as well as the corresponding imaginary-time dependent function 
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(a^(T)ayt(O)) (the single-particle Green's function) by working in the combined space 
of the periodic world line configurations contributing to Z and those with two defects 
corresponding to the presence of a creation and annihilation operator [32]. 

The path integral including interactions. It is not difficult to generalize the bosonic 
path integral discussed above to hamiltonians including potential-energy terms in ad- 
dition to the kinetic energy K in (225). Let us denote by V any interactions that are 
diagonal in the occupation number basis, e.g., attractive {vij < 0) or repulsive (vij > 0) 
terms of the form VijUiUj. With H = K-\-V can decompose exp(— A-j//) as 

e-A,// ^ g-A,^:g-A,v ^ ^(^2) ^ (235) 

where the error is due to the commutator [K, V] ^ 0. With V diagonal, we can for each 
time slice in (223) write 

(a;+i|e-^^^|a;) ^e-^^^'(a/+i|e-^^^|a;), (236) 

where V/ denotes the potential energy evaluated at the Zth time slice. The weight (227) 
of a world line configuration is therefore modified in the presence of interactions as 

W{{a]) = A^'^exp ||-a/£ j . (237) 

The interaction part of the weight can be easily taken into account in elementary world 
line moves of the kind shown in Fig. 51, and also in more sophisticated treatments in 
continuous imaginary time [32]. 

The continuum limit. We have already discussed the fact that the continuum limit 
of the path integral can be used directly in QMC algorithms. However, examining the 

configuration weight, (227) for the purely kinetic hamiltonian or (237) in the presence 
of interactions, we have an apparent problem; the weight vanishes as Ax 0. This must 
clearly be compensated in some way by the number of configurations increasing, in such 
a way that the partition function remains finite. In Monte Carlo calculations we do not 
deal with the partition function directly and only need ratios of configuration weights 
to compute acceptance probabilities for world-line updates. Looking at such ratios for 
the simplest kinds of updates, the insertions and removals of opposite kinetic jumps 
illustrated in Fig. 51, they are also problematic in the continuum limit: Formally the 
probability for insertions and removals is zero and infinity, respectively. This is not just 
a problem in the continuum, but also for small A^, where the probability of accepting an 
insertion would be non-zero but very small. 

Proper ways to handle the continuum limit were introduced in the context of "worm" 
[32] and loop algorithms [179], twenty years after simulations based on discrete path 
integrals were first introduced [177, 178]. Here we only consider local world line moves 
for a toy model, as an illustration of how Monte Carlo sampling can in fact rather easily 
be formulated in the continuum. 

Fig. 51 shows two different ways to introduce two events moving a world line to 
and from a neighboring site. Consider a line segment which is initially straight over m 
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(a) 



(b) 



'^2 



FIGURE 53. Local world line moves in continuous time. In (a), two opposite kinetic jumps are inserted 
at randomly chosen times Xa and T/, between two existing jumps at Ti and T2. In (b), two opposite jumps 
are either removed (^) or the times of the two events are randomly changed to new arbitrary times z'^ and 
between Ti and T2 (^')- The acceptance probabilities are given in Eq. (238). 



time slices, between times Ti and T2, with T2 = '^1 +wAt:. Assuming that there is no 
world line on the neighboring site within the range [ti,T2]5 the weight change when 
introducing the two events is (for a purely kinetic hamiltonian), regardless of where 
in this range the events occur. There are a total of m(m — l)/2 different ways of inserting 
events, which gives a total relative weight A^m(m — l)/2 for the subspace of such 
modified configurations, versus 1 for the original configuration with two less events. In 
the continuum limit, the weight for all the possible updated configurations of this kind 
becomes (T2 — Ti)^/2. As we have discussed above, when A-r — > 0, the average total 
number of kinetic events remains finite. The typical length T2 — Ti of a straight world 
line segments then also remains finite, and we should be able to construct a scheme with 
finite insertion probabilities. The key is that we do not just consider a single specific 
updated configuration, but a continuous range of possible updates. 

We also need to consider the removal of two events. In order to simplify the discus- 
sion, we here consider just a single boson on two sites. This avoids the complications 
of having to consider also constraints imposed by world lines on neighboring sites. The 
world line moves for which we will construct probabilities satisfying detailed balance 
are illustrated in Fig. 53. The event insertion discussed above is illustrated in (a), where 
it should be noted that the times X\ and T2 correspond to two consecutive existing events 
(and in case there are no events, we take Ti = and T2 = We call the times of the two 
new events Xa and T^, and these are chosen at random anywhere between X\ and T2. In 
the opposite update of removing two events, we consider the total weight of two existing 
events Xa and within two surrounding events at Ti and X2- Then we can again compute 
the total weight of the existing events, not only at the fixed current times Xa and but 
at any times x'^ and x'^ within [ti, T2]. That relative weight is again (T2 — Ti)^/2, versus 
1 for the single configuration with those two events absent. 

Thus, once Ti and T2 have been identified by inspecting the current configuration 
(which can be stored in the form of a list of events), the acceptance probabilities for 
two-event insertion and removal are 

{X2-X1Y/2 _ 1 

^--rt - l + (^2-Tl)2/2' ~ l + (T2-Ti)2/2- ^^^^^ 
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For an accepted insertion, we generate Xa and T^, at random, while in a rejected removal 
we also generate new times and for the existing events. 

In a larger system, with more than one boson, the above scheme would have to be 
modified to take into account the constraints of other world lines when a chosen world 
line is moved. The times Ti and T2 should then reflect those constraints. In the presence 
of interactions, the acceptance probabilities would be modified to take into account the 
potential-energy factor in (237). In practice, one would not use these local updates, 
however, as loop and worm updates [32, 179, 33] are not much more complicated to 
implement, but much more efficient. The purpose of the discussion here was to show 
how the apparent problems of the continuum limit can be overcome in principle. 



5.1.2. The Suzuki- Trotter decomposition 

While modern world line QMC algorithms for lattice models are normally based on 
the path integral in the continuum limit, historically time-discretized variants based on 
a Suzuki-Trotter approximation of the time-slice operator exp{—AT:H) were developed 
first. The discrete approach is still some times used and is the most practical option in 
some cases [197]. We here discuss the main features of Suzuki-Trotter based methods 
and how they are applied to boson and spin systems. 

One example of a Suzuki-Trotter (or split operator) approximation [177, 198] was al- 
ready written down in Eq. (235). This is just one among many possible decompositions 
of an exponential of two (or more) non-commuting operators into a product of exponen- 
tials [199]. The most commonly used approximants are, for any pair of operators A and 
B and a small factor A; 

.A(A+B)_ / e^e^ + ^(A2) 



e - [ eAB/2gAAgAB/2 ^ ^(^3) _ (239) 

Here the errors are also proportional to the commutator [A,B]. Using a larger number 
of judiciously chosen exponentials of functions of A and B, the remaining error can 
be further reduced, in principle to an arbitrary high power of A [200]. High-order 
approximants are often too complicated to work with in practice, however. 

If we are interested in the trace of a product of exponentials, as in the path integral, it 
is easy to see that the two low-order approximants (239) are actually equivalent, because 

Tr |ne^/2eAAgAB/2| ^ jr jne^e^j , (240) 

due to the cyclic property of the trace. Thus, although the world line method is often 
discussed based on the first line of Eq. (239), where the error from each factor is A^, 
the error is in effect smaller; oc a\. However, since the number of exponential factors in 
the path integral is L = /3/Af , the total error made in a product such as (240) for fixed A^^ 
and j8 is °« A^. In addition, one can show that for sufficiently large L, the error also does 
not grow with N at fixed A-^. In most cases, one can therefore keep A^ independent of 
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N and /3 (although for some classes of expectation values, care has to be taken to avoid 
error divergences when j8 — )■ 0° [201, 202]). 

In the context of path integrals, the utility of the Suzuki-Trotter approximation is 
that even though exp{—A-^H) in the partition function (223) cannot be easily evaluated, 
if we find a suitable decomposition of H into two terms (or some small number of 
terms), H = Ha +Hb, we may be able to evaluate exp(— A^/Z^) and exp(— A^/Zs) exactly. 
The prototypical example is a one-dimensional system containing only nearest-neighbor 
kinetic energy and interactions, which we collectively denote H^+i for a site pair /, / + 1. 
We then decompose H = Ha+Hb according to 

Ha=Y, Hi,i+U Hb= Hi,i+i, (241) 
odd i even (' 

With this arrangement, all terms in each of the sums Ha and Hg are mutually commuting; 
[Hi^i^i, Hi^2m.i+2m+i] = for any i and m. We can therefore split exp(— A^/Za) and 
exp(— A-ir/Zfi) into two-body operators without making any further errors, e.g., 

g-AfZ/A _ g-Ai:i/l,2g-AT//3,4 . . .g-ATifiv_i_jv (242) 

Using products like this in the time-sliced partition function (223), we can insert com- 
plete sets of states between all the exponentials. Since each operator now involves only 
two sites, the matrix elements reduce to the form 

(j8|e-^^^''^|a) = {npi,npj\c-''^"^j\naunaj), (243) 

which for hard-core bosons are the elements of a 4 x 4 matrix. Since the bond operator 
Hi j conserves the number of bosons on the two sites involved, this matrix is bock- 
diagonal, with two single-element blocks (for nai = Uaj = n^i = n^j) and a 2 x 2 matrix 
with both diagonal and off-diagonal elements connecting the states with a single boson 
on the bond. It is thus easy to evaluate all the matrix elements, also if interactions are 
included (as long as they do not extend beyond nearest neighbors; otherwise a more 
complicated decomposition of H has to be used). 

Pictorially, each matrix element (243) corresponds to a four-site plaquette with zero, 
one, or two world line segments going through it, according to the occupation num- 
bers. Considering all time slices and all site pairs 1 in both exp(— At://^) and 
exp(— A^/Zs), these plaquettes form a checkerboard pattern, with two adjacent rows cor- 
responding to one time slice, as illustrated in Fig. 54. For obvious reasons, the hamilto- 
nian decomposition (241) is also often called the checkerboard decomposition. 

For a hamiltonian conserving the total number of particles, the constraints on allowed 
world line configurations are similar to those in the linear approximation of the time- 
slice operator, with a few important modifications. Kinetic jumps are allowed only on 
the shaded plaquettes in Fig. 54, but there is no further constraint on the number of jumps 
within a time slice, unlike in the linear time-slicing approximation illustrated in Fig. 50 
(where there can be at most one jump in each time slice). The world line configurations 
still look very similar. They become equivalent when A-;: — )■ 0, in the physically relevant 
sense of their evolution from some time x to some later x', with x' — x^ A^. 

In a Monte Carlo simulation, the world lines can be updated using simple moves 
of the kind illustrated in Fig. 51, with the constraint that the diagonal loop segments 
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FIGURE 54. AID world line configuration based on the checkerboard decomposition with the Suzuki- 
Trotter approximation. Kinetic jumps of the bosons (or flips of a pair of ^ and | spins) are allowed only 
across the shaded squares (plaquettes). A time slice of width consists of two consecutive rows of pla- 
quettes. The six isolated plaquettes shown to the right correspond to the non-zero matrix elements, which 
in the case of a spin model with Heisenberg interactions (for world lines and empty sites corresponding 
to t and I spins, respectively) are given by Eq. (244). 



(kinetic jumps) are allowed only on the shaded plaquettes in Fig. 54. More complicated 
"loop" and "directed loop" updates, in which large segments of several world lines can 
be moved simultaneously, are used in modem algorithms [31, 191, 33] (which we will 
discuss in detail below in the context of the stochastic series expansion method). 

Application to the Heisenberg model. It is useful to consider a particular example 
of the path weights in the Suzuki-Trotter approach. Let us compute the plaquette matrix 
elements for the antiferromagnetic Heisenberg interaction; = S, ■ S,+i. In this case 
the boson occupation numbers in (243) are replaced by spin states t and |. We can 
consider the world lines forming between the f spins (and note that we could also draw 
world lines for the 4 spins in pictures such as Fig. 54; they occupy all sites not covered 
by t world lines and cross those lines at each diagonal segment). The calculation just 
involves straight-forward algebra and we just list the results for the six allowed (non- 
zero) matrix elements; 

+e-A./4 

+e^^/'^cosh(A^/2) (244) 
-e^^/Sinh(A^/2) 

The weight of a world line configuration is a product of these matrix elements, all of 
which are pictorially represented in the right part of Fig. 54. Note the minus sign in 
front of the off-diagonal matrix elements. For an allowed world line configuration, all 
the signs cancel out due to the periodicity constraint on the world lines. This is true 
also for world line methods applied to bipartite lattices in higher dimensions, but for 
frustrated systems there is a "sign problem" because of the presence of both negative and 
positive weights (as we will discuss further in Sec. 5.1 .3). In practice, world line methods 
and similar QMC approaches are therefore useful primarily for studies of bipartite spin 
systems and bosons models. For a fermion system, permutation of world lines also 
lead to sign problems, except in one dimension where only global cyclical permutations 
(winding) are possible (with associated signs that can be avoided by choosing periodic 
or anti-periodic boundary conditions [176, 187]). 
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In higher dimensions one can use the Suzuki-Trotter approach with various decom- 
positions analogous to the one discussed above. Local updates such as those we have 
discussed for ID systems can be adapted to higher dimensions as well. It is worth read- 
ing Refs. [79, 203], which are two pioneering world line studies that firmly established 
the basic properties of the 2D 5 = 1/2 Heisenberg antiferromagnet. 

Although the number of time slices used in the Suzuki-Trotter approach does not have 
to depend on the system size (because <x N events can take place within a single time 
slice), the computational effort of Monte Carlo sampling the world-line configurations 
scales in the same way as in the continuous time formulation, as N^. This scaling is 
achieved in the exact continuum formulation by only storing and manipulating the times 
of the events, whereas in the Suzuki-Trotter approach one normally works with the full 
space-time lattice (although one could also in principle only store the events there). For 
this reason, the continuous-time methods actually normally run faster on the computer, 
in addition to not being affected by discretization errors. The only reason to use a discrete 
path integral would be in problems where it is difficult to take the continuum limit in 
practice, e.g., in effective models including dissipation, which leads to time-dependent 
interactions among the world lines [197]. 



5.1.3. The series expansion representation 

As an alternative to the discrete time-slicing approach or continuum limit of the path 
integral, one can also construct a configuration space suitable for Monte Carlo sampling 
by using the Taylor expansion of the Boltzmann operator; 

^-HH ^ £ tP)lH\ (245) 

n=0 

This approach was pioneered by Handscomb [180, 181], who developed a method for 
studying the S = 1/2 ferromagnet. The Taylor expansion was later considered more 
generally as a starting point for exact QMC algorithms for a wide range of models [182, 
185, 186, 187]. The power-series expansion of the exponential operator is convergent 
for a finite lattice at finite j8 (which technically is due to the act that the spectrum of H 
is bounded). In effect, as we will see below, the series expansion allows for a discrete 
representation of the imaginary time continuum, thus avoiding approximations but fully 
retaining the advantages of a finite enumerable configuration space. 
Choosing a basis, the partition function can be written as 

^=L^-T~ L(«ol^l««-i)---(«2|^|ai)(ai|//|ao), (246) 

n=0 {a}n 

where the subscript on {«}„ indicates that there are n states to sum over. This expression 
can be compared with the linear-order discrete path integral (224). Taking H to be the 
bosonic kinetic energy in (225), we can clearly draw world line pictures very similar 
to those in Fig. 50 to represent the contributing terms. The main difference is that 
the number of "slices", the expansion power n, is varying and for given n there are 
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particle jumps at each slice. The new "propagation" dimension we have introduced 
in this representation is different from imaginary time, but it is clear that the label 

= 0, . . . , n — 1 of the states is closely related to imaginary time in the path integral. 
We will discuss the exact relationship further below. 

In the case of only kinetic energy, all matrix elements in (246) equal — 1 for an allowed 
configuration, the minus signs cancel and the weight is 

W{{a}n) = ^. (247) 
nl 

In more general cases, this weight will be modified by the product of matrix elements 
in (246). As in the path integral, the weights are positive definite (or can be made so 
with some simple tricks) for boson systems and quantum spins without frustration in the 
off-diagonal terms. We will here proceed under the assumption of positive definiteness 
and discuss the precise conditions for this further below. 

It is useful to derive a general expression for the total internal energy U = {H). For 
any hamiltonian, we can write it as 

^= zE^-T^ E (ao|^|a„)---(a2|//|«i)(ai|//|ao), (248) 

^n=0 {a}„+i 

where the presence of an additional matrix element of H in each term should be noted; 

for given n, the summation is over n + I states instead of the n states in the partition 
function (246). All configurations that contribute in (248) contribute also to the partition 
function, but the weights differ. We can match the configurations explicitly by writing 

^ = -7 E ^-T^ ft E (aol^l««) • • • (a2|^|ai)(ai|//|ao). (249) 

^ n=l P {a}n 

We can extend the sum over n to include also n = 0, because this term vanishes. 
The terms in (246) and (249) then match exactly, and n/j8 can be identified as the 
energy estimator. Thus, with the configurations sampled according to their weights in 
the partition function, the energy is simply given by 




(250) 



Writing as a sum, H = —Y^iHi, and using this for all instances of H in (246) and 

(248), we can derive a similar expression for the expectation value of an individual term 
Hi. The result for this estimator is the average number of times the operator appears in 
the operator string in the expansion of the partition function; 

{Hi) = (251) 

In the case of a kinetic-energy term, this is identical to the path-integral expression 
(233). This shows that there is a one-to-one correspondence between the paths in the two 
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formulations, if we by path mean just the order in which kinetic events occur, without 
reference to the times T; of the events in the discrete or continuous path integral. This 
correspondence holds quite generally. The difference is that in the series expansion, the 
potential-energy terms are treated in the same way as the kinetic operators (i.e., they 
are not re-exponentiated, as they are in the path integral). Their presence corresponds 
to "non-events", which do not affect the world lines but are associated with diagonal 
matrix elements that modify the weight of the paths. In the path integral, there is instead 
the potential-energy factor in Eq. (237), which depends on the actual time-points of the 
kinetic events. 

Eq. (250) shows that although the sum over expansion orders n in (246) extends up 
to n = oo, it can in practice be truncated at some order rimax °= N^. To further motivate 
why this will only lead to an exponentially small and completely negligible error, one 
can make use of the estimator for the specific heat, obtained by taking the temperature 
derivative of (250); 

C={n^)-{nf-{n). (252) 

When the temperature T — )■ 0, C should vanish, and (252) then shows that the variance 
of the distribution of n equals (n). It is therefore clear that the distribution vanishes ex- 
ponentially beyond some power n oc . In practice, QMC (stochastic series expansion, 
SSE) algorithms based on this representation automatically sample the n-distribution 
according to the relative weights of the different n sectors. 

By explicitly truncating the Taylor expansion, one can make the relationship between 
the series and path integral representations even clearer. Truncating at n = L, we can 
formally construct a fixed-size sampling space by augmenting all powers H" with n<L 
hy L — n unit operators /. Allowing for all possible locations of the unit operators in 
the product of L operators, we can define a modified operator string S = Si,S2,....,Sl, 
in which Si G {HJ}, and do a summation over all these sequences. We then have to 
compensate the weight by the number (^) of equivalent terms, giving 

( — B)"(L — n)^ 

L £ £(ao|5L|aL-i)---(a2|52|ai)(ai|5i|ao), (253) 

S {a}L{Si} 

where n now refers to the number of elements 5,- = H in the operator string (and we 

no longer need an explicit sum over n). If we now consider the boson kinetic energy 
(225) and take the limit L — )■ oo, the weight reduces to (j8/L)", which is the same as 
the path integral weight (227) for L time slices. In this limit, it is clear that the index 
p = 0, . . . ,L — 1 is related to imaginary time according to T = pjS /L. The difference 
is that the series expansion with the full weight, j8"(L — n) !/L!, is in practice exact 
for L I5N, whereas in the path integral we have to take the continuum limit to avoid 
a discretization error. For finite L, the series index p does not correspond exactly to 
imaginary time, but represents a distribution of imaginary times [187]. Because of the 
close relationship between the propagation index p and imaginary time, it is appropriate 
to refer to the p space as the "time" dimension also in the series expansion. 

The practical advantage of the series expansion is that it provides an exact but discrete 
representation of the imaginary time continuum. Algorithms based on it are normally 
easier to implement than continuous-time world lines and can be more efficient compu- 
tationally. On the other hand, for a hamiltonian with large potential-energy terms, the 
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number of operators {n) in the expansion can be much larger than the number of ki- 
netic events {hk) in the path integral. The latter approach should then be more efficient, 
although an exact treatment necessitates manipulation of floating-point time variables. 
For quantum spin systems, there is in general a favorable balance between kinetic (off- 
diagonal) and potential (diagonal) energies, and the series expansion is then typically 
preferable. Next, we proceed to develop a QMC algorithm based on it. 



5.2. SSE method for the 5 = 1 /2 Heisenberg model 

We now discuss in detail how to implement the SSE method for the 5 = 1 /2 Heisen- 
berg model. Initially we do not have to specify the lattice and consider the hamiltonian 
written as a sum of bond operators. 

Hi, = Jb^i{p) ■ ^j{p) , (254) 

as in (204), with the lattice encoded as a list of sites [i{b)J{b)\ connected by the bonds, 
b= 1 , . . . , A^fo. An example of a labeling scheme in 2D was given in Fig. 46. 

A positive-definite SSE can be constructed for any bipartite lattice, i.e., when all sites 
i{b) and j{b) belong to sublattice A and B, respectively, in any number of dimensions. 
This constitutes a large class of interesting and important systems. Initially we will con- 
sider just a single antiferromagnetic coupling constant 7^ = 7 > 0, but the modifications 
needed for non-uniform systems are very simple. 



5.2.1. Configuration space 

It is useful to subdivide the Heisenberg interaction (254) into its diagonal and off- 
diagonal parts in the standard basis of diagonal z spin components. We then define 
operators with two indices, Ha^i,, with a = 1,2 referring to diagonal and off-diagonal, 
respectively, and b= 1, ... jA'/, is the bond index as before; 

H2,b = '2iSt,)SJ^b) + S7ib)Sm)- (256) 

Here we have also introduced a minus sign and a constant in the diagonal operator, so 
that the full hamiltonian can be written as 

H=-jY^{Hi,i,-H2,b) + -^. (257) 
b=l ^ 

The reason for including the constant in Hi^h is to make the series expansion positive- 
definite, as we will see shortly. The constant JNi,/4 is irrelevant in the algorithm but we 
will include it when calculating the energy. 
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Series expansion of the partition function. The general form of the partition function 
in the SSE approach can be written as (246), but here we will initially not write out all the 
complete sets of states inserted between the operators. We instead focus on the operators, 
expanding all instances of H as sums over all the bond operators in (257). The starting 
point of the SSE algorithm for the Heisenberg hamiltonian is thus 




(258) 



Here J has been absorbed into /3 = J/T, S„ refers to the products (strings) of the 
Heisenberg bond operators (255) and (256) originating from H", 

5„ = K0),K0)],Kl),Z7(l)],...,Kn- !),%-!)], (259) 

and n2 is the total number of off-diagonal operators, i.e., the number of elements with 
b{i) = 2 in the string. Note that the label p referring to the position of an operator in 
the string here takes the values 0, . . . , n — 1; this labeling will be more convenient than 
p= 1 , . . . , n in the program implementation. 

When the operator string acts on the state \a) = I^J, . . . , 5^) we get a succession of 
other basis states, with no branching into superpositions of more than one state. We will 
refer to these as propagated states \a{p)), 

p-i 

l«(p))-n^a«,MOl«)- (260) 

i=0 

We have not written down the normalization of these states explicitly but will consider 
all \oc{p)) as properly normalized. These states of course correspond to those in the 
summations over complete sets in (246), but in practice the SSE approach is framed 
around the summation over the operator strings in (258), and we do not always have to 
write out the states explicitly. 

With the constant 1/4 in (255), all operations on parallel spins destroy the states; 

Hi,b\ U{b)lj{b)) = 0) H2,b\ li{b)lj{b)) = 0- (261) 

An operator- state configuration (ct,5„) contributing to Z thus has to involve only op- 
erations on anti-parallel spins, and the propagated states (260) are defined under this 
assumption. The corresponding matrix elements are 

{'ti{b)ij{b) \Hi,b\ 1'i{b)ij{b)) = J, {ii{h)tj{b) \H2,b\ ti{b)ij{b)) = h 

{U{b)1'j{b) \fii,b\ ii{b)tj{h)) = \-, {'\i{b)ii{b) \H2,b\ li{b)'\j{b)) = 5- (262) 

The fact that all these are equal will be very useful in the sampling algorithm, and this 
was the reason for including the constant 1/4 in the diagonal operators. 

In addition to the local constraints of only operations on anti-parallel spins, for the 
matrix element of the full operator product in (258) to be non-zero, the propagation also 
has to satisfy the periodicity \oc{n)) = \ a{0)), where \ a{0)) — \ a). 
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Although one can formulate Monte Carlo sampling procedures in the space of oper- 
ator strings of fluctuating length n [180, 187, 204], it is normally in practice somewhat 
easier to work within the fixed-size truncated space discussed in the previous section 
(and mathematically it is also easier to prove detailed balance in such a space). ^ In ac- 
tual simulations, the cut-off L will be determined automatically by the program, as we 
will discuss below, such that L safely exceeds the largest n ever sampled. Note again that 
the truncation then does not cause any detectable errors and should not be considered as 
an approximation. Defining the unit operators used for augmenting strings with n < L as 
Hq_q = I and including [a{p) , b{p)] = [0, 0] as an allowed element in the index list (259), 
the partition function can be written as 



a Sl ^■ 



L-1 
p=0 



a , (263) 



where n refers to the number of non-[0, 0] elements in the fixed-length operator string Sl 
(and n is summed implicitly by the sum over Sl). Since all matrix elements equal 1/2, 
the weight of an allowed configuration is given by 

W(a,S.)=(|)"(^. (264) 

Here we have left out the sign factor (— 1)"^, which, as we will discuss further below, 
always is positive for a bipartite (unfrustrated) system. Note also that L! is an irrelevant 
normalization factor and can be left out as well. 

With t and 4- spins represented by the presence and absence of a hard-core boson, 
respectively, we could draw pictures like Fig. 50 to illustrate the rules for the contributing 
SSE configurations. However, we now also have diagonal operators that do not change 
the world lines but modify the path weight. In the path integral approach the diagonal 
operators are all brought together into a common exponential factor, Eq. (237), but 
there is no simple way to do this in the series expansion. As mentioned above in 
Sec. 5.1, the SSE approach should not be used when the diagonal energy dominates, 
but in the case considered here we have (//i,/,) ~ (^2,z?) and the diagonal operators do 
not dominate. As we will see below, the presence of a significant number of diagonal 
operators is in fact exploited in the SSE algorithm. When discussing the SSE method, 
it is useful to draw pictures showing explicitly the diagonal and off-diagonal operators. 
An example is given in Fig. 55, which includes also illustrations of the data structures 
that we will later use in a computer implementation of the method. The states will be 
represented by integers C7(/) = ±1 corresponding to = ±1/2. For compact storage of 
the operator string, it is also useful to combine the indices [a{p),b{p)] into a single list 
of integers s{p) — 2b{p) +a{p) — 1. The full propagated states do not have to be stored 
simultaneously. They can be generated as needed from the single stored state |ot(0)) and 



° A case where it is clearly better to work with sequences of fluctuating length n is when the SSE approach 
is combined with a multi-canonical ensemble, in which a range of temperatures is sampled and therefore 
n can fluctuate over a very wide range [204]. 
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FIGURE 55. An SSE configuration for an 8-spin chain, with all the propagated states shown. Open and 
solid bars indicate diagonal Hi j and off-diagonal //2.1 operators respectively, while no bar between states 
corresponds to a "fill-in" unit operator Hq q. The f and | spins of the state \a) are stored as a{i) = ±1, 
and the operator string Sl is encoded using even and odd integers for diagonal and off-diagonal operators, 
respectively, according to s{p) = 2b{p) +a{p) — 1. 



A- A- A- A- A 

FIGURE 56. An example of three off-diagonal operations (indicated by bars) bringing all spins on a 
triangle back to their original states. Each spin flip is associated with a minus sign, resulting in a negative 
path weight and a "sign problem" (due to cancellations of configurations with different signs) in QMC 
simulations of this and other frustrated systems. 



the operator string. We will later introduce a different compact storage involving some 
spins of the propagated states as well. 

Frustrated interactions and the "sign problem". At first sight, it appears that we 
have a sign problem — a non-positive definite expansion — because of the factor ( — 1)"^ 
in (258). Actually, all the terms are positive for a bipartite lattice. This is because an 
even number ^2 of off-diagonal operators are required in every allowed configuration, in 
order to satisfy the "time" periodicity \oc{L)) = |a(0)). We already discussed this in the 
context of the world line method, where the off-diagonal matrix elements in (244) are 
negative, but the continuity of the world lines require an even number of these. This is 
yet another example of the close relationship between the two approaches. 

For frustrated systems, the series expansion is not positive-definite (and neither is the 
path integral, for exactly the same reason). This can be easily demonstrated for a system 
of three spins on a triangle. As shown in Fig. 56, an allowed configuration can in this 
case contain three off-diagonal operators, resulting in an over-all minus sign. This is 
true for any system in which loops with an odd number of sites can be formed between 
antiferromagnetically interacting spins — this can be used as the definition of frustration. 

Positive-definiteness for a bipartite system can also be proved in a different way, by 
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carrying out a unitary transformation of the spin operators on one of the sublattices, say 
B, such that St —Sj and SJ —SJ (and no change in the diagonal operators Sp for 
j e B. This does not affect the spectrum of the model (since the commutation relations 
among all spin operators remain unchanged), but the sign in front of the off-diagonal 
terms in the hamiltonian (257) changes to +. The factor (—1)"^ in (258) is then absent. 
For a frustrated system, no such transformation can remove all the signs. 

Note that only the off-diagonal part of the interaction causes a sign problem. One can 
study systems with frustration in the diagonal part, but then the interaction is no longer 
spin-isotropic. Just neglecting the sign for a frustrated Heisenberg system corresponds to 
antiferromagnetic diagonal couplings but ferromagnetic off-diagonal couplings. Here we 
only consider isotropic interactions and restrict the discussion to unfrustrated systems. 

For a bipartite system with periodic boundaries, the absence of sign problems holds 
strictly only if the lattice lengths Ly are even in all directions 7 = 1, . . . ,<i. Winding 
configurations, like the one shown for the path integral in Fig. 50(b), are negative if 
there is an odd number of windings around an odd number of sites. Clearly, in such 
cases there is also frustration, due to the boundaries. We will therefore consider only 
systems of even length (in all periodic lattice directions). 

Linked-vertex storage of the configurations. We have discussed computer storage of 
the SSE configuration as a state |a(0)) and an operator string S^. By acting sequentially 
with the operators, it is easy to generate all the propagated state \OL{p)) illustrated in 
Fig. 55. However, during the Monte Carlo sampling we will need to access operators 
and some properties of the propagated states also in non- sequential order — given an 
operator and the spins it acts on, we will need to to know which operators act on those 
spins next (when moving up as well as down in the operator sequence). It would be 
prohibitively time consuming to propagate a single state back and forth to extract this 
information, and also it would not be practical to store all the propagated states. We 
therefore use also another kind of data structure, in which the "connectivity" of the 
operators is explicit and represented as a network in a compact way. This linked vertex 
structure is illustrated in Fig. 57, using the same SSE configuration as in Fig. 55. Here, 
in the pictorial representation, the constant state of each spin between operators have 
been replaced by straight lines, which in a computer program will correspond to links 
(pointers). The operators are shown along with only the two spin states before and after 
each operator has acted. We will call these spin-operator objects vertices and refer to the 
four spins as the legs of the vertices. One stage of the Monte Carlo sampling procedures 
will involve making changes to the vertices (while their locations are changed at another 
stage). The links will allow us to quickly move between the vertices and make a series 
of changes maintaining all the constraints. 

The allowed vertices are dictated by the hamiltonian. In the case of the isotropic 
Heisenberg system considered here there are four of them, depicted in Fig. 58. They 
of course correspond to the non-zero matrix elements (262), and, again, the constant \ 
in the diagonal operator is the reason why there is no vertex with all four spins equal. 
Anisotropic interactions or an external magnetic field would necessitate inclusions of 
additional vertices [190, 33] (and the algorithm discussed here would then also have to 
be modified). Although the spin states at the four legs uniquely identify the vertices, we 
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FIGURE 57. Linked vertex storage of the configuration in Fig. 55. In the graphical representation to 
the left, constant spin states between operators have been replaced by lines (links) connecting the spins 
just before and after the operator acts. The links can be stored in a list X{v), where the four elements 
V = 4p + l, I = 0,1,2,3, correspond to the legs (with the numbering convention shown in Fig. 58) of the 
vertex at position p in the sequence Sl. For two linked legs v and v', X{v) = v' and X(v') = v. 
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FIGURE 58. Allowed vertices for the isotropic S =1/2 Heisenberg model. The numbering / = 0, 1 , 2, 3 
of the vertex legs corresponds to the position v = Ap + 1, in linked-list storage illustrated in Fig. 57. 



will continue to use also the open and solid bars in pictures, to indicate diagonal and 
off-diagonal vertices, respectively, for added clarity. 

For a given position p in the operator sequence St, the corresponding list element 
s{p) tells us the operator type (diagonal or off-diagonal) and the bond b on which it 
acts (as explained in Fig. 55). As will become clear below, along with this information, 
we only have to store the connectivity of the vertices, not their spin states. The links 
allowing us to jump between connected vertex legs are stored as a list X(v), as explained 
in Fig. 57. For clarity of the illustration, the one dimensional list has here been arranged 
in four columns, with elements labeled v = Ap + l, corresponding to each type of leg, 
/ = 0, 1,2,3, with the labeling specified in Fig. 58. We will later describe an efficient 
way to construct this linked list, given the operator sequence. For now, it is sufficient to 
know that for a given operator at location p in the sequence, the position of its Z:th leg 
in the linked vertex list is v = 4p -|- Z. This leg is linked to another vertex leg with list 
address v' =X (v) . This kind of structure constitutes a doubly-linked (bi-directional) list, 
with X \X{v)] = V, in which we can move both "up" and "down". From a position v in the 
list we can extract the corresponding operator location in 5^, p = v/4 (its integer part) 
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FIGURE 59. (a) A pair of off-diagonal operators, indicated by solid-line boxes, which can be replaced 
by diagonal ones if the spins between the two operators are flipped as well. Such an update cannot be 
done with the operators enclosed by dashed boxes, because of the illegal spin configuration (a vertex with 
all four legs in the same spin state) that would result at the operator acting between the two boxed vertices 
on their left spin. However, as shown in (b), this operator pair can be changed if instead the spins on the 
opposite sides of the operators ai^e flipped. This also forces a change in the stored state \a). 



and leg index / = mod[v, 4] . We can move "sideways" to the other leg on the same vertex 
by changing the leg label o 1 or 2 o 3, which in both cases can be expressed as a 
simple rule of an evenoodd change in the list address v. These movements will allow 
us to construct closed loops of changed spins and operators. 



5.2.2. Monte Carlo sampling procedures 

When Monte Carlo sampling the partition function (258), we have to make changes 
in the operator sequence Sl as well as in the stored state | a) . It is clear from Figs. 55 and 
57, however, that updates in these two data structures are not necessarily independent. 
Allowed changes (i.e., ones maintaining all configuration constraints) in operators at 
locations p E {pi, . . . ,p2} lead to corresponding changes in the propagated states in 
the range \(x{pi + 1)), . . . , \a{p2))- Fig. 59 illustrates such an update of two operators, 
where only the types of the operators are changed, diagonalooff-diagonal (which leads 
to changes in the states similar to the world line update in Fig. 51). As the changed 
operator at pi acts on \a{pi)), the state |a(pi + 1)) and subsequent states also change 
(by two flipped spins in the example), but when the last updated operator at p2 has acted 
the modifications are "healed" and the resulting state |a(p2 + 1)) is the same as before 
the update. Since the propagated states form a cyclically permutable periodic structure, 
there is nothing special with the stored state |a) = \oc{0)) and it can change as well, 
depending on the locations of the updated operators. 

Because of the time (p) periodicity of \a{p)), the way state changes are forced by 
operator updates is not unique. In the above example, where we assumed P2 > Pi, 
we could also have acted first on \ a{p2)) with the new operator at p2. Then the states 
\(x{p2 + 1)), . . . , \a{L— 1)) as well as |a(0)), . . . , \a{pi)) would be affected. There are 
always two ways of changing the states in this way, but only one may be allowed in 
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FIGURE 60. Diagonal update. An operator is either inserted, [0,0]^ [^ib]p, at a randomly selected 
bond or removed, [\,b\p^ [0,0] p. In the former case, the update is canceled if the spins at the chosen 
bond are pai-allel, and therefore the state \a{p—\)) must be known. The acceptance probabiUties depend 
on the number of operators n in the sequence before the update attempt, according to Eq. (265). 

any given case because of the constraints, as illustrated in Fig. 59. In most cases, such 
updates done at random locations would not at all satisfy the constraints, and one has to 
specifically look for two (or more) operators that can be updated [205]. 

As seen in Figs. 55 and 57, the state |a) can some times be updated without any 
changes to the operator sequence — since there is no operator acting on the spin (7(1), 
its state is arbitrary. Such free spins are very rare at low temperatures, when the average 
number of operators on each spin is large. 

Focusing on the operator sequence, we will sample the number n of non-unit operators 
[a,b\p 7^ [0,0], their positions p G {0, . . . ,L— 1} in Sl, bond indices G {1, . . . ,Nh}, and 
types induces a G { 1 , 2} . The general strategy is to let the expansion order n change only 
in exchanges of diagonal operators with the fill-in unit operator; [0,0]p o [l?^]/? (where 
R — 7- n ± 1 ). Then, keeping all lattice locations b of the operators fixed, we change the type 
index, [l,i»]p •(-)■ [2,^?]^ for some set of operators (as in the example with two operators 
above, but we will do it in a much more efficient way involving an arbitrary number 
of operators). With these off-diagonal replacements carried out properly, the sampling 
is ergodic in combination with the diagonal n — )■ n ± 1 updates. We next consider the 
details of these updating procedures. We first assume that the expansion cut-off L has 
been properly determined and then discuss how this can be ensured in practice. 

Diagonal updates. Updates of single diagonal operators, illustrated in Fig. 60, can 
be carried out sequentially at the locations p = 0, . . . ,L — 1 in 5^. There are no constraints 
involved in such an update in the direction [l,b\p [0,0] p (removal of a diagonal 
operator), whereas an insertion of a diagonal operator, [0,0]p — [l,Z?]p, is allowed only 
if the spins on bond b are antiparallel, o{i{b)) ^ o{j{b)), in the propagated state \ ol{p)) 
on which the operator acts. We therefore have to generate these states (storing only 
the one currently needed), which is simply done by flipping the two spins o{i{b)) and 
a{j{b)) each time an off-diagonal operator [2,b]p is encountered (in which case no 
single-operator update can be carried out at p). 

The weight ratio to use in the Metropolis acceptance probability is easily obtained 
from (264). Note, however, that we also need to correct for the inherent imbalance of 
these update attempts — there is only a single unique way of carrying out the update in 
the direction [1 , Z?]^ — )■ [0, 0]^, whereas when changing [0, 0]p — )• [1 , b]p the bond b should 
be generated at random among the Nh possible locations [and if, for the chosen b, the 
spins o{i{b)) = o{j{b)), the update is immediately rejected and we proceed to the next 
p]. Therefore, following the criteria for detailed balance discussed in Sec. 3.2, specif- 
ically the transition probability (43) written as a product of selection and acceptance 
probabilities, we have to include the ratio A^^ of the selection probabilities for updates 
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involving a specific bond b. We then obtain the following acceptance probabilities; 



^accept([0,0]^[l,Z7]) = min 
/'accept([l,^]^[0,0]) = min 



■ 

2(L-n)' 
'2{L-n+l) 

, Wb 



(265) 
(266) 



where n is the number of operators before the update (and after n— orn— >n — 1). 

To prove detailed balance for the diagonal updates, it may seem necessary to carry 
them out at random positions p, not sequentially. Otherwise, after having carried out an 
update at some position p, there is zero probability of carrying out the reverse update as 
the next step. However, if we consider the whole sequence (sweep) of updates from p = 
to P = L— I and follow this by a reverse updating sequence, starting from P = L—\ and 
ending at p = 0, then detailed balance holds for the sweeps. In practice, one does not 
have to switch between the two directions of updating the operator sequence. 



Off-diagonal updates. Updates involving off-diagonal operators clearly have to in- 
volve at least two operators in order to maintain the periodicity constraint on the prop- 
agated sates. The simplest kind of pair update was already discussed and illustrated in 
Fig. 59. For a ID system, such updates are ergodic with open boundary conditions, but 
with periodic boundaries local updates cannot change the topological winding number 
(which in clear from Fig 50, where no local deformations can affect cyclic permutations). 
This may still not be a very serious problem in principle, because for any T >0 only the 
sector with zero winding number contributes when the system size — > oo (in practice 
for some large N, larger for lower T). If we are interested in ground state properties, we 
also can obtain the correct result when T ^ for fixed A^, because in this limit an ef- 
fective winding number defined within a finite range p,...,p-\-Dp, with Dp S> A^^, of the 
propagated states \ci{p)) (corresponding to a finite range of imaginary times) can still 
fluctuate freely, irrespective of the constraint of zero global winding number. Physical 
quantities are then insensitive to the winding number [206]. To converge to the ground 
state, much lower T has to be used, however. 

In addition to their inability to change the winding number in one dimension, in higher 
dimensions the kind of operator substitutions shown in Fig. 59 cannot lead to any of 
the permutations of same spins (permutation of particles in the boson language) that 
should be included (while in one dimension, cyclical permutations of all spins are the 
only possible permutations in the kind of system we consider here). To remedy this 
problem, one can construct other kinds of local updates, e.g., involving two operators 
acting on the comers of a plaquette of 2 x 2 sites on a 2D square lattice [205], which can 
sample among all such permutations within a given sector of winding numbers. Updates 
changing the topological winding number has to involve lines of sites wrapping around 
the boundaries, but for lattices of length larger than ^ 20 these targeted updates become 
very unlikely (i.e., it becomes too difficult to satisfy the constraints when a large number 
of operators are changed simultaneously) [186, 205]. 

Fortunately, instead of dealing with updates specifically targeting permutations and 
winding number sectors, there is a very efficient and simple type of loop update that 
accomplishes all these things automatically. This class of updates was initially intro- 
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FIGURE 61. A linked-vertex SSE configuration with one loop traced out and shown in both of its 
"orientations", along with the corresponding operator-index sequences. All spins covered by the loop are 
flipped, and operators are changed, diagonal o off-diagonal, each time the loop passes by (with no net 
change of an operator visited twice). Every vertex leg (spin) belongs uniquely to one loop, and spins not 
acted upon by any operator (here the one at / = 1) can also be regarded as forming their own loops. 



duced as a generalization of a cluster algorithm for the Ising model to a model where the 
flipped clusters take the form of loops; the classical six-vertex model [191]. The effec- 
tive world line system for the S = 1/2 Heisenberg model constructed using the discrete 
Suzuki-Trotter decomposition is exactly equivalent to an anisotropic six-vertex model, 
and the loop update for it was therefore at the same time a generalization of the clas- 
sical cluster update to a quantum mechanical system. These ideas were subsequently 
applied also to continuous-time world lines [179] as well as to the off-diagonal updates 
in the SSE method [190]. The improvements in performance relative to local updates 
are enormous (as in the classical case, leading to a much reduced dynamic exponent) 
and brought simulations of quantum spin systems to an entirely new level. Like classical 
cluster algorithms, the loop updates are in practice limited to certain classes of models, 
of which the isotropic Heisenberg systems is one. Generalizations of the loop concept to 
worms [32] and directed loops [33] (both of which can be regarded as loops that are al- 
lowed to self-intersect during their construction, unlike the original loop updates where 
no self-intersection is allowed) are applicable to a wider range of systems. 

For the 5=1/2 model considered here, there is no reason to even discuss local off- 
diagonal updates in any greater detail, and we will just focus on how to implement the 
much more powerful loop updates. In the case of the SSE method, the operator string is 
again the main focus, and the loop update corresponds to constructing a loop of operators 
(vertices) connected by the links in the linked-list representation. 

Operator-loop updates. An example of an operator-loop and how it is flipped is 
shown in Fig. 61. Here "flipping" refers to the spins along the loop (explicitly those on 
the vertex legs and implicitly in all propagated states covered by the loop) as well as 
the operators themselves when we map the changes back to the sequence Sl\ a diagonal 
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vertex with two legs attached to a loop changes to off-diagonal, and vice versa. In the 
case of all four legs of a vertex belonging to the same loop (and example of which is 
present in the figure), the operator type does not change. 

The key aspect of the loop update is that the weight (264) only depends on the number 
of operators, which does not change when the operator-loop is flipped. Therefore, 
according to the detailed balance rules, such a flip can always be accepted. It is also clear 
that loops can be large, and a loop flip can therefore lead to changes in the global spin 
permutations. The previously discussed pair substitutions correspond to flipping loops 
of two operators on the same bond. Any operator substitution specifically constructed 
to sample permutations can also be formulated as an operator-loop, including ones that 
change a winding number. 

Note that the loops are completely deterministic once the operator locations have been 
specified and each spin in the full space-time configuration belongs uniquely to one 
loop. The diagonal update is the de facto mechanism by which the loops are changed, 
and the purpose of the loop update is just to identify and flip some of the loops. It is 
then best to construct all the loops and to flip each of them with probability 1/2 (which 
also maintains detailed balance), instead of constructing loops at random and always 
flipping them (in which case some loops would be constructed more than once and 
flipped unnecessarily). Free spins, on which there are no vertices (such as the first spin 
in Fig. 61), can also be considered as loops, since flipping such a spin implicitly flips a 
whole line of spins in the full configuration of time-periodic propagated state. 



5. 2. 3. Computer implementation 

A convenient definition of a Monte Carlo sweep in the SSE method is a full sequence 
of diagonal updates, followed by construction (and flip with probability 1 /2) of all loops. 
From the loop illustration in Fig. 61, it is clear why the linked- vertex storage X() of the 
SSE configuration is useful; constructing a loop corresponds to moving in this list using 
very simple rules (following a link or moving laterally to the adjacent spin on the same 
vertex). On the other hand, in the diagonal update the original storage illustrated in 
Fig. 55, using a single spin state oi) and the operator string s(), is more convenient. 
The configuration will always be stored in this way, and the linked vertices will be 
constructed before each set of loop updates. The loop updates are carried out by moving 
in the linked list updating the corresponding operators in si) at the same time (as 
indicated in Fig. 57). The stored spins <7() can also be affected by the loop updates, 
and this can be taken care of after all loop flips have been carried out. We now discuss 
pseudocode implementations of all the main steps involved in a Monte Carlo sweep. 

Diagonal update. We assume that we currently have a valid configuration stored 
(which in the beginning of the simulation can be a random spin state and an operator 
sequence with only [0,0] elements), using a{i) = ±1 for the individual spins and the 
compact storage of [a,b\p as single integers s{p), with even and odd numbers 2b and 
2b + 1 corresponding to diagonal and off-diagonal operators, respectively (as illustrated 
in Fig. 55). A sweep of diagonal updates can then be implemented in the following way; 
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do p = to L — 1 

if(5(/7) = 0)then {27} 
& = random[l, . . . ,A^/,]; if C7(/(Z?)) = <7(j(^)) skip to next /? 
if (random [0 — 1] < ^insert («)) then s{p) — 2b; n — n + l endif 

elseif (mod[5(j!?),2] = 0) then 

if (random[0 — 1] < ^remove(«)) then s{p) = 0; n = n — l endif 

else 

b = s{p)/2; am) = -^m)-^ <y{m) = -<y{m) 

endif 
enddo 

Here Pmsert(«) and Premove(«) are the acceptance probabilities (265) and (266) for insert- 
ing and removing a diagonal operator when the current expansion order is n. These prob- 
abilities can be precalculated or evaluated on the fly. One can also just precalculate the n- 
independent constant ^N^ / 2 and accept a removal if random[0 — 1] x (L — n) < ^Nt, / 2 
and an insertion if random[0 — 1] x PNi,/2 < L — n+ I. This avoids the more time 
consuming divisions required with the full acceptance probabilities (265) and (266), or 
storage of a large number of the n-dependent probabilities. 

The sites belonging to bond b can be stored in a list [i{b)J{b)]. With the exception of 
this list, the SSE sampling algorithm is lattice independent. It is also easy to modify the 
above code for non-uniform (Z?-dependent) Heisenberg coupling strengths. The diagonal 
update is the only stage at which the interactions appear explicitly, and no modifications 
at all are required in the loop update. 

Construction of the linked vertex list. We will construct the linked list by traversing 
the operators s{p) , starting from p — 0. Recall that for vertex leg number / = 0, 1 , 2, 3 of 
an operator at p, the corresponding links are stored as X(v) with v — 4p + l,as illustrated 
in Figs. 57 and 58. To construct the list, we will make use of two other data structures. 
The position v in X() of the first vertex leg for spin / will be stored as Vfirst(0' while 
^iast(0 will be the position of the last leg on / encountered so far. Note that the first 
leg on a given spin is always below the operator (before the operator has acted, / = 0, 1), 
whereas the last leg is above the operator (/ = 2, 3). Initially all yfirst(i) and yiast(0 are set 
to —1. Later, when encountering an operator s{p) acting on spin / and V\ast{i) > 0^ then 
^lastlO is the previous list element corresponding to an operation on i. We then use the 
corresponding lower leg (/ = or 1, depending on which of the two spins acted on equals 
of the new operator and set the links between v = 4p + l and Viast(0; ^(^) = ^iast(i) 
and X(yiast(0) — ^- On the other hand, if Viast(0 = then we have found the first 
operation on / and set yfirst(0 = v. In both cases, we set the element for the last operation 
on i as V[ast(0 =v + 2, where the addition of 2 corresponds to the vertex leg on spin i 
after the operation (leg index / = 2 or 3). For each operator, we have to examine both 
the spins ii and i2 on which it acts. In the following pseudocode, the vertex list positions 
to be filled for the operator at p are vq + /, with VQ = 4p, where the leg pairs / = 0, 2 and 
/ = 1,3 correspond to spins ii and i2, respectively. We use vi and V2 to denote the list 
elements of the last (previous) operation on the two spins. All links, except those across 
the boundary p = L — 1 , 0, can be constructed according to; 
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do /7 = to L — 1 

if (s{p) = 0) skip to next p {28} 
vo = 4p; b = s{p)/2; ii = i{b); i2 = jib); vi = Viast('i); V2 = Viast(«2) 
if (vi ^ — 1) then X(vi) = vq; ^(vq) = vi else Vfirst(ii) = vo endif 
if (v2 7^-1) then X{v2) = vq; X{vq) = V2 else Vfirst('2) = vo + 1 endif 
Vustih) = VQ + 2; Viast(i2) = vo + 3 
enddo 

To connect the links across the time boundary we can use the arrays containing the first 
and last list positions; 

do z = 1 to 

/ = Vfirst(0 {29} 
if (/ ^ -1) then / = yiast(0; ^(/) = ^(0 = / endif 
enddo 

Since there are also positions in X() containing no links, corresponding to fill-in oper- 
ators s{p) = 0, we should have some way to distinguish these. We can set all X{v) to a 
negative number, e.g., X{v) = —1, before constructing the links. Then any v for which 
X(v) > can be used as a starting point to trace a loop. 

Implementation of the operator-loop update. We want to trace all the loops and flip 
each of them with probability 1/2. In order to make sure that no loop is considered 
more than once, we should always start a new loop from a position in X() not previously 
visited. We then need some flag to indicate whether a position has been visited or not. 
Instead of allocating separate storage for such flags, we can actually use the list X{) 
itself — the links stored in it only have to be used once, and as soon as a link at v has 
been used we can set X{y) to a negative number, to indicate that this position should 
not be used again. To start a new loop, we can look for the first position vo for which 
^(vo) > 0. Before traversing the loop, we make the random decision of whether or not to 
flip it. If we do not flip, the loop tracing still has to be carried out, in order to flag all the 
corresponding X (v) as visited. For a flipped loop, we should also change the operators 
in *() to reflect the change diagonal-H-off-diagonal for each visited vertex. We will take 
care of possible flips of free spins in the stored state | a) later. The structure of a code 
for tracing and flipping the loops is; 

do Vo = to 4L - 1 step 2 

if (Z ( vo) < 0) skip to next vo { 30 } 

V = vo 

if (random[0 - 1] < 1 /2) then 

• traverse the loop; for all v in loop, set Z(v) = — 1 
else 

• traverse the loop; for all v in loop, set X(v) = —2 

• flip the loop (change operator types s{p = v/'X) while loop is traversed) 
endif 

enddo 
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Here vq is a tentative starting point for a new loop, and we actually start a new loop 
only if X(vo) is a non-negative number. Note that we only have to consider even starting 
points (the step 2 on the first line of {30}), because of the structure of the vertex list 
and the loops, i.e., two consecutive (even, odd) elements in X{v) always correspond to 
the same loop. To not visit the same loop more than once, we set X{v) = —1 for loops 
that are only visited, and X{v) = —2 for loops that are also flipped. We will need the 
distinction between flipped and only visited loops later, when updating the stored state 
I a) to reflect the flipped loops. 

To trace a loop starting at some vq, we can use the leg index Iq = niod[vo, 4] and move 
to the adjacent leg Iq on the same vertex. With the leg labeling convention in Fig. 58, the 
rule for the leg adjacent to a leg / can be summarized as / = (0, 1,2,3) —>/' = (1,0,3,2). 
This rule can be very efficiently implemented using a bit-level operation on /, as /' 
corresponds to flipping (0^1) the lowest bit of I. Here we just assume that such 
functionality is available in the programming language used [e.g., in Fortran 90 one 
can use the exclusive-or operation of the integer / with 1; /' = ieor(/, 1)] and assign 
Iq = flipbit(/o,0), where is the bit flipped. We actually do not need to extract the leg 
index itself, but can do the corresponding move to the adjacent leg just by manipulating 
its location vq = 4po + ^ in We can use the same bit flip method to find it; 
Vq = flipbit(vo,0). To move to the next vertex, we use the link; vi = X{vq). This 
completes one step of the loop tracing procedure. We next proceed in the same way 
to find the location v\ of to the adjacent leg, and from there we move to the leg V2 linked 
to it. This continues until at some step k we find — vq. We have then completed a full 
loop. In pseudocode form the traversal of a loop and flipping it can be accomplished by 
(where no subscripts or primes are needed on the visited list locations v); 

v = vo 

do {31} 

p = v/4; 5(p)=flipbit(5(p),0); X{v) = -2 

v = flipbit(v,0); X(v) = -2 

v = X{v); if(v = vo)exit 
enddo 

Here the flip of the operator type diagonal-<->off-diagonal is also accomplished using 
the flipbit procedure, which corresponds exactly to the changing the operator code s{p) 
from 2b (diagonal operator) to 2Z? + 1 (off-diagonal operator) or vice versa. In the case 
in code {30} where the loop is not to be flipped, the only difference is that the two 
statements involving p and s{p) in {31} are absent, and we mark all visited locations as 
X(v) — 1 instead if —2 (information which will be used to update the state \ oc)). 

After all loops have been traced, we also have to update the spins in the stored state 
I a). For each site i which has a flipped loop passing through it, the stored spin o{i) 
should be flipped. We should also flip with probability 1/2 any spin that has no loop 
connected to it (i.e., spins on which no operator acts). Both these tasks can be taken care 
of using the information stored in VfirstlO during the construction of the linked vertex list. 
If V = Vfirst(0 — th^t implies that there is no operator acting on spin /. Otherwise, 
we can use the flag —1 or —2 now stored mX{v) to determine whether the loop passing 
through / has been flipped [with Z(v) = —2 for flipped loops]. In pseudocode form: 



Computational Studies of Quantum Spin Systems 



January 18, 2011 



153 



do / = 1 to A'^ 

V = yfirst(0 {32} 

if (v = -l)then 

if (randoin[0-l]< 1/2) o{i) = -o{i) 
else 

if (X(v) = -2) a {i) = -o{i) 
endif 
enddo 

This completes the loop update, and we can repeat the cycle consisting of: (i) perform- 
ing a sweep of diagonal updates, (ii) constructing the linked vertex list, and (iii) trac- 
ing/flipping all the loops and updating \ a) accordingly. These procedures constitute one 
Monte Carlo sweep in the SSE method. 

Equilibration and expansion truncation. We now discuss the expansion cut-off L 
(when the fixed-length scheme is used). We should make sure that L exceeds the max- 
imum expansion order n that will be sampled, so that the truncation in practice is not 
an approximation. Since new diagonal operators are inserted through exchanges with 
the fill-in unit operators, we also want there to be a reasonably large number of these 
s{p) = elements present. But since the memory and CPU time scales with L, we should 
not make L excessively large. One could in principle define an optimal L, for which the 
acceptance rate of the diagonal updates is maximized. In practice, it does not matter 
much, however, whether L is really optimal, since the off-diagonal updates are anyway 
more important in determining the autocorrelation times. We will therefore just make 
sure that some significant fraction of the elements s{p) = 0, so that the sampled n never 
reaches L and that many insertions of operators can be attempted in the diagonal update. 

We can aim for {l+a){n) with, e.g., a — 1/3. Since the relative fluctuations of n 
are small, ~ ^/n, we can do this by letting L^axn and augmenting Sl with additional 
s{p) = elements after each Monte Carlo sweep when a x n > L [setting s{p) = 
for Loid + 1 < p < Lnsw or distributing the zeros randomly. Fig. 62 shows an example 
of how this procedure works in practice, during equilibration of a 16 x 16 system at 
inverse temperature j8 = 16. Initially L was set to N/2 = 128. The adjustment of L 
quickly converges to an acceptable value, and, as shown in the inset, in a subsequent 
long simulation n never comes close to L. 



5.2.4. Estimators for physical observables 

The autocorrelation time in SSE simulations with loop updates is typically very 
short; often just a few Monte Carlo sweeps or less [33, 31]. Calculations of physical 
observables ("measurements") should therefore normally be performed after every (or 
every few) Monte Carlo sweep. As discussed in Sec. 3.2, the autocorrelation times do 
not dictate how often measurements may be done, only with what frequency independent 
data are generated. Autocorrelations, thus, affect the error bars but have no effect on the 
correctness of computed expectation values (i.e., it is not wrong to measure correlated 
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MC sweeps 

FIGURE 62. Evolution of the expansion cut-off L at the initial stage of an equilibration run of a 16 x 16 
Heisenberg system at j3 = 16. The number of operators n in the string after each Monte Carlo sweep is 
also shown, along with the maximum n reached so far The final cut-off after 5000 sweeps was L = 6764. 
The inset shows the distribution of n in a subsequent run of 10^ MC sweeps. 



data, although it may be wasteful if the measurements take a long time to evaluate) 
provided that the total simulation time is much longer than the autocorrelation time. Data 
binning should be used to compute error bars reliably, as also discussed in Sec. 3.2. We 
here discuss several types of observables of interest in SSE calculations. More details 
and derivations of the expressions can be found in Refs. [186, 187, 207]. 

Energy and specific heat. We already discussed some observables in Sec. 5. 1 .3; the 
internal energy (250) and the related expectation value (251) of an individual operator 
in the hamiltonian, as well as the specific heat (252). Given that (H) = — (n)//3, it may 
seem surprising that the specific heat C = ((//^) — {H)^)/T^ is not given just by the 
fluctuation in n, but there is also a term —{n). This is because (H^) = {n{n — l))/j8^, 
which can be easily shown using the same procedures leading to (H) = — (n) //3 in 
Sec. 5. 1 .3. Note that this expression includes the constant 1/4 subtracted from each bond 
operator in (256), but that this constant cancels out in the specific heat expression (252). 
The specific heat in practice becomes difficult to compute reliably (i.e., its statistical 
error is large) at low temperatures (where it becomes small), because it is the difference 
of two large numbers (~ N^j^^). 

Diagonal operators. Expectation values of operators diagonal in the z-component 
basis are also easy to evaluate, using averages over the propagated states; 

(Oz) = jj^{aip)\0,\a{p)) = j^Ozip)- (267) 
^ p=0 ^ p=0 
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It is not necessary to evaluate this average using all p, as successive propagated states 
differ by at most two flipped spins. One can instead use a partial summation over p, 
e.g., Oz(0) + OziN) + Oz{2N) + . . ., to save time. In some cases, however, one may as 
well just propagate O^ip) from /) = Otoj!?=L— Iby first computing 0^(0) and then 
update the value O^ip) according to the spin flips occurring when propagating | a (/?)). 
For example, the fully averaged contribution to the expectation value of the squared 
staggered magnetization from an SSE configuration can be computed as follows: 

do/7 = 0toL-l {33} 
if (mod(s(p),2)=l)then 

b = s{p)/2; am)^-Gmy^ c{j{b)) = -c{j{b)) 
m — m + 2^iO{i{b)) 
endif 

if (s{p) 7^ 0) ms2 = ms2 + rrp' 
enddo 

ms2 = nisi/inN^) 

Here 0, = ±1 is the staggered phase factor for site i and m contains the staggered 
magnetization, which evolves as the operator string is traversed. The change in m when 
two antiparallel spins are flipped can be expressed as 2<j>iO{i{b)), because ^jO{j{b)) = 
<l>iO{i{b)). The average of m squared is accumulated as ms2, in this example only using 
the n propagated states generated by the original index sequence without the fill-in unit 
operators [i.e., skipping the steps where the operator s{p) — 0]. One can also sum over 
all L instances of the states, in which case the result would be divided by L instead of n 
on the last line. As written above, the special case n = is not treated correctly, but the 
code is easy to modify by just using ms2 = in^ /N^ in that case. 

The computational effort of the measurement in {33} scales as ^N, i.e., the same as 
the SSE sampling algorithm. To compute the Fourier transform of the spin correlations 
(the static structure factor) at some arbitrary momentum [noting that corresponds to 
S{k), with K denoting the staggered wave-vector, e.g., K = {k,k) in two dimensions], 
e.g., for use in the correlation-length definition (70) or (73), one can use a similar 
procedure for the real and imaginary parts of m{q), 

m(q) = J^5^cos(q-r)-KjJ^5^sin(q-r), (268) 

r r 

with in {33} replaced by the corresponding sine and cosine factors [and explicitly 
considering the changes from both the flipped spins at i{b) and j{b)]. The structure 
factor is then accumulated according to S(q) = ^(q) -|-Re^{m(q)} -|-Im^{m(q)}. 
For computing the full correlation function, 

Q(r,y) = (5^5^.), (269) 

averaged over all ij [or the Fourier transforms for all q, which can be written as 
(m(q)m(— q))], it would not be practical to update all these oc N different functions after 
each spin flip as in {33}. It is then better to compute all the correlations from scratch 
in, e.g., every N:th propagated state. This still leads to a rather expensive scaling oc 
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of the effort to measure all the correlations — a factor A'^ worse than the sampling. One 
should then judge whether it is better to compute only a subset of the correlations [e.g., 
along some lines in the plane] or just measure them less frequently, in order for 
the measurements not to completely dominate the calculation. 

Susceptibilities. Another important class of observables are generalized susceptibil- 
ities, i.e., linear response functions of the form 

Xab— — — , 

where b is the prefactor in a field term bB added to the hamiltonian and A is the operator 
whose response to this perturbation we want to compute. For example, we may be 
interested in the response Xij at site j when a magnetic field acts only on site in which 
case A = S^j and B = S^. Such a susceptibility is given by the Kubo formula [186] 

P 

Xab^ I dT{AiT)BiO))-P{A){B), (271) 



where A(t) = e~^^Ae^^. If both A and B are diagonal, this Kubo integral can be 
evaluated in SSE simulations using the generic formula 



/ 





"..(.(.j.fo)) . (-^ g.,,,) (|;>,„) ) . (^^LAWBw) . 

(272) 

The sums over A (p) and B{p) can be computed using code similar to {33}. 

We are often interested in susceptibilities for which A = B, e.g., the magnetic response 
X{(1) at wave- vector q, in which case A —B — m(q), with m(q) the Fourier transform of 
the spin configuration given by (268). Since the exact value of xi<l) must be real-valued, 
one only has to compute the real part of (272). If all local response functions Xij = 
Xi^ij) are computed in a simulation, the Fourier transform xi^l) can later be evaluated 
using these. These response functions are directly accessible in NMR experiments; see 
Ref. [208] for an example. 

Note that if A and B commute with the hamiltonian, then A(t) = A(0) = A [and 
A{p) = A and B{p) = B are independent of p in (272)] and the susceptibility reduces to 
the classical expression Xab — j8 {{AB) — (A) {B)). In practice, for the Heisenberg model 
the only case where this form applies is the uniform magnetic susceptibility; 

X = X{0) = ^{M^), M,= f^Sl (273) 

Here the term j8 (Mj-)^ in (271) vanishes, since H does not include a magnetic field. 

If the operators A and B in (271) are not diagonal, the susceptibility is more com- 
plicated in general. Here we will only discuss an important special case, in which the 



Computational Studies of Quantum Spin Systems 



January 18, 2011 



157 



estimator actually is very simple. If the operators involved are two terms of the hamil- 
tonian, as defined in Eqs. (255) and (256) and here referred to just as Hj^ and Hb for 
any two instances of those (any two diagonal or off-diagonal bond operators), then the 
susceptibility measurement just involves counting the numbers N{A) and N{B) of those 
operators in the sampled SSE operator sequences; 

Xh^Hs = ^ [{N{A)N{B)) - 5a,b{N{A))] . {21 A) 

The most important example of this type is the current susceptibility Xhh'' where the 
spin current operator Ix (here in the lattice x direction, for definiteness) is defined by 

N 

h = Y,[S-{ri)S+{ri + x)-S+{ri)S-{ri + x)], (275) 

i=l 

where we assume, for simplicity, a hamiltonian with only nearest-neighbor interactions. 
For longer-range interactions, there would be corresponding current terms between the 
same site pairs as in the hamiltonian. Although (275) is not exactly a sum of the off- 
diagonal operators (256) used in the SSE sampling, those operators can be written as 
sums of two parts, 

H2,b = Hi^ + fJh' ^b=^\b)^J{by ^b=^J{h)K{by (276) 

where the site pairs [i{b)J{h)] in each case are assumed to be ordered in such a way 
that the -I- and — terms transport one unit of spin in the positive and negative direction 
Tj — Ti of the bond, respectively. A key point here is that, although the SSE configurations 
contain the full off-diagonal operators H2^b^ only the -|- or — part in (276) contributes in 
each instance (with the other part destroying the state it acts on). One can therefore use 
(274) to evaluate local current-current response functions of the form 

H,b, = JdT{Ib,{T)Ib,{0)), {111) 


where 4 here denotes the current operator at bond b. Using (274) one obtains 

hb,b, = {[N^{h)-N-{b,)][N+{b2)-N-{b2)])-5b,b,{N+{b,)-N-{b,)), (278) 

where N^{b) and N~ (b) denote the number of operators in the SSE operator sequence 
transporting spin in the positive and negative direction, respectively, across the bond 
b. The meaning of this becomes clear when looking at a graphical representation of an 
SSE configuration, such as Fig. 55. There one just has to sum the number of off-diagonal 
events in which an f spin is moved to the right (the positive direction) minus the number 
of left-moving events. The term N^{b) —N^{b) is zero, unless the winding number in 
the lattice direction defined by bond b is not zero, which can be seen in the illustration of 
winding numbers in Fig. (50). It can be noted that the form (278) of the current response 
is identical in SSE and path integral (world line) methods. 
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Spin stiffness. We discussed the spin stiffness in the context of the classical XY 
model in Sec. 3.5. The basic definitions of this quantity at T = 0, Eq. (83), and at 
r > 0, Eq. (86), are identical for quantum spin systems (XY or Heisenberg models) 
as well as bosonic systems more broadly (where the analogous quantity is the superfluid 
phase stiffness, which is proportional to the superfluid density). Deriving Monte Carlo 
estimators for SSE or world line methods, one finds that the term j8 (/^) in (97), which 
is a classical response function, should be replaced by the corresponding quantum 
mechanical Kubo formula, giving the spin stiffness in the form 

Ps = ^(^-) - ^ / dTiUtMO)). (279) 


where we assume that the phase twist is imposed in the lattice x direction. The Kubo 
integral here consists of a sum of bond-current response functions of the form (277), 
and using the result (278) one finds that the energy term (Hx) is exactly canceled by the 
5-function terms coming from (278). This leads to a very simple expression for the spin 
stiffness in terms of the SSE (or world line) configurations: 

Ps = ^{{N^-N-)^). (280) 

Here and A^"" denote the total number of operators transporting spin in the positive 
and negative x direction, respectively. For a spatially isotropic system in two or three di- 
mensions, this can of course be averaged over the dimensions, whereas for an anisotropic 
system the stiffness depends on the direction. 

The spin stiffness is often expressed as the fluctuation of the winding number (here in 
the jc direction, with analogous expressions for other directions), 

Wx = Un+-N-), (281) 

i.e., the size normalized current (Wx — 0, ±1, ±2, . . .). For a J-dimensional system with 
N = L'^ the stiffness is just p, = (W^) /p for any direction a, but for other shapes (e.g., 
N = LxXLy with Lx ^ Ly) the lengths also enter and (280) looks simpler. 

In derivations of the spin stiffness, it is normally for simplicity assumed that the spins 
order in the XY plane (in spin space), whereas in a Heisenberg model the direction of 
the order parameter in spin space is not restricted to this plane (the symmetry is not 
broken) in an SSE simulation of a finite system. A simple rotational averaging argument 
shows that the expression (280) should be multiplied by 3/2 in order to obtain the correct 
stiffness of a symmetry-broken state (i.e., when considering the thermodynamic limit). 
The expression (280) applies also to the quantum XY model [209], where the result 
should not be multiplied by 3/2. 

A winding number estimator completely analogous to (280) was originally derived in 
the context of the superfluid phase stiffness of bosons in continuous space [210]. The 
role of global cyclical permutations of particles in superfluidity dates back to Feynman's 
pioneering work on path integrals [175]. 
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FIGURE 63. Example of clusters formed by space-time loops passing through a propagated state \ ci{p)) 
(for arbitrary fixed p). Here there are six such clusters, labeled 1 , . . . , 6. Open and solid circles correspond 
to t and l spins, respectively, in \a{p)). When a loop is flipped, all spins in the corresponding cluster are 
also flipped, as indicated here with two different configurations corresponding to the two states of cluster 
1 (the sites enclosed by larger circles). Note that the spins within each cluster are always in one of the two 
staggered configurations. 

5.2.5. Improved estimators 

The operator-loop update in the SSE method (as well as loop updates more broadly 
[31, 191]) is an example of a cluster update. Such non-local updates were first developed 
for classical Monte Carlo simulations of the Ising model [118]. One aspect of cluster 
methods is that it is possible to take averages of estimators for physical quantities over 
all orientations of the clusters, because the configuration weight does not change upon 
flipping a cluster. This is immediately clear in SSE simulations of 5 = 1/2 Heisenberg 
models, because the weight (264) only depends on the number of operators n in the 
sequence, which does not change when a loop is flipped. If the number of clusters (here 
operator-loops) is m, then the total number of equal-weight configurations is 2"\ and 
the average over all of these configurations can provide a much less noisy estimator 
than one depending on just a single configuration. The crucial point here is that, for 
many important quantities, this average can be computed analytically, and the resulting 
improved estimator is of a simple form that can be evaluated rapidly in simulations. Here 
we only discuss the rather simple cases of the the static (equal-time) structure factor and 
the uniform magnetic susceptibility. For improved estimators for some other quantities, 
see the review article by Evertz [31]. 

Consider a propagated state \a{p)), e.g., the stored |ct(0)). In the linked vertex 
representation of the SSE configuration, illustrated in Fig. 61, there is a loop passing 
through each of the spins in this state (with spins without operators acting on them also 
considered as individual loops). The same loop can go through many spins in |a(0)), and 
all spins belonging to the same loop form a cluster, in the sense that if the loop is flipped 
all the spins in the clusters are flipped simultaneously. Note that the loops are objects 
in space-time, while the clusters discussed here are defined on a cut at fixed time (here 
propagation index p). A cluster can consist of several parts that appear disconnected in 
space, since such pieces can be connected in the larger space-time volume where the 
loops exists. An example of clusters on a 2D lattice is shown in Fig. 63 

Since we are dealing with a bipartite lattice, and because the loop structure is such 
that the spin on a vertical loop segment (referring to pictures such as Fig. 61) changes 
each time one changes direction when moving along a loop, the spins within a cluster 
formed at a given state \ cc{p)) always have a staggered structure. The staggered magne- 
tization ms{j) of a cluster labeled j, with j — 1, . . . ,C, where C is the total number of 
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clusters, is then nisij) = ±nj, where nj is the number of spins in cluster j. For a given 
configuration, the total staggered magnetization Ms = Lj=i"^i (j)- When averaging the 
square of this sum over all the different realizations of cluster orientations, the cross 
terms {ms{i)ms{j)) = (for / ^ f). One is then left with just the / = j contributions, and 
the staggered structure factor is simply given by 



Structure factors at other wave-vectors q are only marginally more complicated, de- 
manding in place of the cluster sizes nj a summation over each cluster of the phases 
0rexp(/q ■ r), where r refers to sites on a given cluster and (^r = ±1 is the staggered 
phase factor, which takes care of the staggered spin structure within the clusters (and the 
denominator 4 corrects for the fact that the spin values are ±1/2). One can here also use 
the fact that the true structure factor must be real-valued for any q. 

In principle, equal-time correlation functions such as the structure factor can also 
be averaged (fully or partially) over the propagation index p, as in Eq. (267). This, 
however, requires more work for the improved estimator than in code {33} for the simple 
estimator, because it takes some book keeping during the loop update to construct the 
clusters for several fixed p, and doing so may not always pay off. Without this averaging, 
however, a simple p-averaged estimator, such as the one implemented in code {33}, may 
actually give better results at low-temperatures, where the gain due to averaging can be 
very significant. The case q = is special in this regard, because this corresponds to the 
total squared magnetization, which is a conserved quantity (i.e., independent on the SSE 
propagation index p), and no further averaging over p can then be done to improve the 
statistics further. The optimal estimator for the uniform susceptibility (273) is therefore 



Susceptibilities at other wave- vectors involve the full space-time loop structure, not just 
the clusters (cut through the loops) formed at a fixed state. For example, the staggered 
susceptibility is given by the sum of the squares of all the loop sizes [31]. 



QMC programs should always be verified by comparing results for small systems 
with exact diagonalization data. When correctly implemented, the SSE method should 
be exact, which means that the deviation of a computed quantity from its true value 
should be purely statistical, due to the finite number of sampled configurations. We 
have discussed how to quantify the statistical fluctuations in terms of "error bars" in 
Sec. 3.2. Deviations beyond the error bars are due either to programming errors of flaws 
in the random number generator used. While most programming errors would lead to 
obviously wrong results, there are also possible subtle errors that may only lead to 




(282) 




(283) 



5.2.6. Program verification 
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FIGURE 64. The uniform susceptibility of a 4 x 4 Heisenberg system versus the temperature. The curve 
is the exact result from a full diagonalization. The points are SSE results based on lO'" updating sweeps 
for each T . The error bars cannot be resolved on this scale. The inset shows the deviation of % from the 
exact result with error bars, for both simple and improved (loop) estimators. The data points have been 
slightly shifted off their actual T values in order for the error bars of the two estimators not to overlap. 



minute deviations from exact results for small lattices. Such systematical errors may 
grow with the system size, with potentially grave consequences. The same may be said 
about random number generators; bias effects due to imperfect random numbers may 
be very small for small lattices, but can become more significant for larger systems. It 
is therefore important to make comparisons with exact diagonalization results based on 
very long SSE runs, to detect possible small deviations. While it is impossible to strictly 
prove that a program is correct in all respects, agreement to within very small error bars 
with exact data makes this very likely. 

Test results. We now discuss some test results for ID and 2D Heisenberg models. 
The random number generator used in these calculations (and most other SSE calcula- 
tions discussed in these lecture notes) was a simple 64-bit linear congruential generator 
with multiplier 2862933555777941757 [211] and addition of 1013904243. 

Fig. 64 shows the susceptibility per spin of a 4 x 4 system. For each temperature 
10^*^ updating sweeps and measurements were carried out, with the data subdivided into 
100 bins for computing the statistical errors. For T = 7/10, this required approximately 
100 CPU hours on a mid-range PC workstation. The error bar (defined as one standard 
deviation of the estimated fluctuation of the average, as discussed in Sec. 3.2) is typically 
fti 10^^ when the standard estimator for % is used, and even smaller with the improved 
estimator (283). The relative error (the error bar divided by ;^) is ^ 5 x 10~^ for T /J ^ 1. 
The deviations of the averages are completely consistent with the size of the error bars. 
Recall that for correctly computed statistical errors, one should expect about 2/3 of the 
data points to bracket the true data within one error bar. The gain in precision with the 
improved estimator can be much larger than in Fig. 64 for larger systems. 

While it is essential to confirm the unbiased nature of the SSE calculations on small 
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FIGURE 65. SSE results for the internal energy per spin at several inverse temperatures /3 — 2"' for 
Heisenberg chains of length = 1024 and 4096 compared with the corresponding exact (Bethe ansatz) 
ground state energies. The SSE results were obtained using 1 — 2 x 10^ updating sweeps for each m, which 
required several hundred CPU hours for the largest A^, )3 . 



lattices, one might still worry about potential problems with the random number gen- 
erator for larger systems. In one dimension, we can also test SSE calculations against 
exact Bethe ansatz results for very long chains in the ground state. To approach the 
ground state in SSE calculations, it is convenient to use inverse temperatures of the form 
j8 = 2" and go to sufficiently large m for calculated quantities to become m independent. 
This approach is illustrated in Fig. 65, which shows the internal energy, E = (H), com- 
puted using the simple expansion-order estimator (250) for chains of length = 1024 
and 4096. Numerically computed Bethe ansatz results for these chain lengths are listed 
in [212]. The agreement between the calculations is perfect within error bars, with no 
detectable temperature dependence for the last three or four points for each A'^. This good 
agreement, to within relative statistical errors as low as 5 x 10^^, shows quite convinc- 
ingly that the calculation is for practical purposes completely unbiased and that ground 
states of even quite large systems can be studied. 



5.3. Applications of SSE to ID and 2D systems 

We next discuss several illustrative results for ID and 2D Heisenberg models obtained 
with the SSE method. Results showing clearly the logarithmic corrections to critical 
behavior in the ID chain are presented in 5.3.1. The qualitatively different ground states 
of n-chain ladder systems with even and odd n are discussed in 5.3.2. The sublattice 
magnetization and some other low-energy parameters associated with the Neel state are 
computed for the standard 2D Heisenberg model in 5.3.3, and in 5.3.4 dimerization is 
introduced in this model, to drive a quantum phase transition into a plain non-magnetic 
state. Quantum critical finite-size scaling behaviors of various quantities are analyzed. 
The more complex case of a Neel-VBS transition is investigated in 5.3.5, using two 
types of J-Q models that exhibit, respectively, continuous and first-order transitions. 
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FIGURE 66. Spin correlation function at distances r = 2"' for chains of length = 1024 and 4096. 
The error bars are smaller than the symbols. The solid curve is of the expected form Ar^ ' ln(r/ro)'/^, 
with A = 0.21 and ro = 0.08. The dashed curve shows the form °^ r^^ for comparison. These results 
were obtained using inverse temperatures /3 = 2'^ and 2'^* for — 1024 and 4096, respectively, which is 
sufficient for T — > convergence. 



5.3.1. The Heisenberg chain 

Spin correlations at T = 0. In Sec. 4.3.1 we discussed Lanczos results for the spin 
correlation function of the Heisenberg chain and saw some hints of the expected loga- 
rithmic correction to the ~ 1/r critical behavior (Fig. 34). The system sizes accessible 
with the Lanczos method are not sufficient for studying these scaling corrections quanti- 
tatively, however. As we saw above, with the SSE method unbiased studies of the ground 
state is possible for chains of several thousand spins (with careful checks of the conver- 
gence to the r — 7- limit). Fig. 66 shows the spin correlations for = 1024 and 4096 
at distances r = 2"\ graphed on a log-log scale. To save time, only the correlations at 
these distances were computed [for a scaling N\og{N) of the time to carry out spatially 
averaged measurements]. The results for the two system sizes coincide closely for r up 
to 2^, indicating convergence to the infinite size values up to this distance for A'^ = 1024 
(and therefore up to r ^ 2^ for = 4096, since the convergence behavior should scale 
approximately linearly with A^^). 

The expected form \C{r)\ = A\n^/^{r/ro)r-^ [57, 58, 59] is very well reproduced up 
to r = 2^ for A'^ = 4096. The parameters A and ro obtained from a fit are listed in the 
figure caption. If one leaves the exponent o = 1/2 of the logarithm as a free parameter 
to be obtained from the data based on a fit, the exponent indeed comes out close to 0.5, 
but with a rather large error bar of, roughly, ±0. 1 . To really investigate the exponent 
carefully, one should further increase the chain length (which is possible in principle). 

If one did not know about the existence of a log correction and tried to extract the 
form of the spin correlations on the basis of numerical calculations alone, one might at 
first sight conclude that the decay is 1/r" with Cf 0.85, based on the data in Fig. 66. 
There are, however, small but significant deviations from a pure power-law, which can 
be detected only if the relative statistical errors are sufficiently small. In the data shown 
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FIGURE 67. The circles show SSE results for the uniform susceptibility of the Heisenberg chain at 
temperatures T of the form 2^"', m — 0, 1,2, ... for different power-of-2 chain lengths up to = 4096. 
Error bars are much smaller than the circles. The exact = 16 result is shown as the solid curve and the 
dashed curve shows the low-T form (284). The asymptotic T — > value, ]% = I/tt^, is indicated by the 
short horizontal line segment on the left side of the graph. 



in the figure, the error bars are too small to discern. While the absolute errors are small, 
typically ^ 2 x 10^^ for = 4096 (based on approximately 10^ Monte Carlo sweeps at 
/3 = l}'^ and using an improved estimator), the relative error is much larger, about 0.002 
for r = 2^, growing to 0.02 for the longest distance r = 2^^. 

Low -temperature magnetic susceptibility. Logarithmic corrections are also impor- 
tant in the Heisenberg chain at T > 0. The most prominent example is the uniform mag- 
netic susceptibility, for which a renormalization-group study of the low-energy field- 
theory predicted the form [7, 213] 

where the T — > value x{^) = 1 /2;rc, where c = Jk /2 h the spinon velocity, agrees 
with the Bethe ansatz solution of the ground state of the Heisenberg chain. The Bethe 
ansatz can also be extended to T > 0. Good agreement with the asymptotic expression 
(284), with the parameter Tq = 1.1 (adjusted to fit the Bethe ansatz results), was found 
at low temperatures (T/J < 0.02) [7]. Note that the logarithmic correction in (284) 
implies a very slow convergence to the T = limit, and it would therefore be difficult 
to extrapolate numerical results without knowing about the presence of this kind of 
correction. A higher-order logarithmic correction to (284) is also known [213], but here 
we will just consider the leading-order correction. 

Fig. 67 shows SSE results for the susceptibility obtained using chain lengths of the 
form N = 2" and temperatures T /J = 2^"\ Due to the excitation gap in chains with 
finite A'^ (discussed in Sec. 4.3.1), x decays exponentially to zero below a temperature 
r ~ l/N. The log-lin scale used in the figure makes these finite-size effects very clear 
and also shows how the results converge rapidly to the thermodynamic limit form once 



Computational Studies of Quantum Spin Systems 



Januai-y 18,2011 



165 



the finite-size gap becomes smaller than T . The agreement with the form (284), using 
the value Tq — 1.1 determined in Ref. [7], is very good for T/J below 0.05 (in fact, for 
some unknown reason, the agreement appears to be even better than in Ref. [7]). 

The main utility of the form (284), beyond its role in establishing the correctness of 
the low-energy field-theory, is that it is valid not only for the simple Heisenberg chain 
considered here, but for any ID spin system (at sufficiently low temperatures) which 
is in the same phase as the Heisenberg chain. This includes the frustrated J1-J2 chain 
discussed in Sec. 4.3.2, for J2/J1 less than the dimerization point (ii/^i)^ ~ 0.241. 
Exactly at the dimerization point the leading logarithmic correction should vanish [164] 
(but there are still other, higher-order corrections), while above the transition point a 
spin gap opens and X ^ven for N = 00, The asymptotic form (284) should be valid 
throughout the QLRO(;r) phase of the chain including longer-range interactions [the 
model discussed in Sec. 4.3.3], as well as in many other systems. The parameters c 
and Tq depend on the model parameters and fitting of numerical data provides a way to 
extract, in particular, the velocity c. As we will see in the next section, ladder systems 
consisting of an odd number of chains also are in the same critical phase as the single 
chain, and (284) applies also there. 



5.3.2. Ladder systems 

A ladder lattice consists of a fixed number Ly of coupled ID chains (often referred to 
as the legs of the ladder) with the chain length Lx taken to infinity (or, in practice, Lx/Ly 
sufficiently large to give results converged to this limit). Strongly-correlated quantum 
systems in this geometry [9, 214, 215] play an important role as a means of interpolating 
between one and two dimensions. Many materials exhibit a structure of weakly coupled 
ladders with small Ly, which motivates studies of Ly = 2, 3, etc. On a more fundamental 
level, it is very interesting to see how the special properties of ID systems evolve with 
increasing Ly and approach the 2D limit [216]. Here we will investigate some of the 
most essential properties of Heisenberg ladders with Ly up to 6. 

The most important aspect of the physics of S = 1/2 Heisenberg ladders is that they 
have completely different low-energy properties for even and odd Ly (the number of sites 
on each rung of the ladder). For even Ly there is always a spin gap (which vanishes as 
Ly — )■ 00), while odd-Ly systems are gapless and have properties similar to a single ID 
chain (below an energy or temperature scale which vanishes as Ly 00), This difference 
can be understood roughly based on a simple picture of valence bonds [217], illustrated 
in Fig. 68. If there is no coupling between the rungs (Jx = 0, 7y > 0), the ground state of 
the 2-leg ladder is just the product of rung singlets — a unique (non-degenerate) state with 
a gap Jy to the first excited state (in which one of the singlets is promoted to a triplet). 
For the 3-leg ladder, on the other hand, the ground state of an individual rung is two-fold 
degenerate, with 5^ = ±1/2. When coupling the rungs (Jx > 0), this degeneracy is lifted, 
but, for any ratio Jx/Jy, there is a remnant of the degeneracy, which can be understood 
as arising from Lx interacting 5=1/2 degrees of freedom. This leads to low-energy 
properties similar to those of a single 5=1/2 chain. In the 2-leg ladder, the singlet- 
triplet excitation gap remains non-zero for any Jx/Jy, and the low-energy excitations 
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FIGURE 68. Illustration of the ground states of the 2-leg and 3-leg Heisenberg ladders at inter-rung 
coupling 7v = 0. The vertical bars indicate valence bonds (singlets), and the circles show unpaired ^ and 
], spins. The ground state of the 2-leg ladder is a unique singlet-product, whereas in the 3-leg case each 
rung-state is two-fold degenerate, with 5ju„g = ±1/2. Each of these states is a symmetric combination of 
the two states with the unpaired spin on the upper and lower chain. Here a random configuration of the 
location and of the unpaired spin is shown for each rung. 



above this gap form a band of propagating rung triplets [218]. The spin correlations are 
exponentially decaying with distance, with the correlation length diverging as Ly — > oo. 

In the valence bond basis (discussed in Sec. 2.2), the 2-leg ladder with > is still 
dominated by short bonds (one can say that it is an RVB spin liquid, but, due to the 
constraints of the ladder geometry, it is also appropriate to call it a VBS), whereas the 
critical state of the 3-leg ladder requires bond probabilities that decay with the bond 
length as a power-law [217] (and such a state may be called a critical RVB state). The 
same pictures remain valid also for the ground states of ladders of larger width, but the 
energy scale associated with ladder behavior vanishes as Ly — )• oo, with cross-overs into 
2D behavior at higher energies. The relevant energy scale is the spin gap for even Ly and 
the spin stiffness for odd L^ — as we discussed in connection with the KT transition in 
Sec. 3.5, power-law correlations can be sufficient to sustain an non-zero spin stiffness, 
and this is the case with the ~ 1/r correlations in the single chain and odd-leg ladders. 

Here we will discuss results only for spatially isotropic couplings; = Jy. For even 
Ly, the behavior is similar for periodic and open boundary conditions in the y direction 
(and for Ly = 2 periodic boundaries only corresponds to doubling Jy), while for odd Ly 
the gapless nature of the system for all Jx/Jy applies only for open y boundaries. In the 
"tube" geometry (periodic y boundaries), odd Ly leads to frustration, and the behavior 
is then much more complex, with several possible ground state phases [219]. In the 
SSE calculations discussed below, open y boundaries were used in all cases, while the x 
boundaries were periodic. 

Spin correlations. In the preceding section we studied the spin correlations of the 
single chain and confirmed the presence of a logarithmic correction to the oc 1 /r critical 
behavior. We now test this behavior for odd Ly> I, and also look at the exponentially 
decaying correlations for even Ly. 

With open boundary conditions in the y direction, the correlator (S/ • Sj) is not a 
function just of the separation r,y between the two spins, but depends on the y coordinates 
of both spins. Here we use the maximal reflection- symmetric (in the y-direction) distance 
between two spins, i.e., for a given spin at {x,y), with x = 1, . . . ,Lx and y = 1, . . . ,Ly, 
the correlation is computed with the spin at (x + Lx/2,Ly —y-\-\), with the periodic 
boundary condition taken into account in the x direction. Averages are then taken over 
all {x,y). This is just one convenient choice, and it is not important exactly how the 
separation in the y direction is treated, because the correlations anyway depend only 
weakly on it when the x separation is large. 
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FIGURE 69. Spin correlation function at the longest x distance (with a certain averaging over y 
separations, as discussed in the text) in x Ly Heisenberg ladders with different L,, as a function of 
the length L^. The dashed line has the form C{x) °^ 1 /x. 

Fig. 69 shows the long-distance correlation function versus x = Lx/2 for ladders of 
width Ly = 1—6. The qualitative difference between even and odd Ly is clear, and in the 
case of odd > I the behavior for large is very similar to the = I system, with 
deviations from the form C{x) <^ l/x that can be explained by a logarithmic correction 
(which we will not analyze in more detail here). For even L^^, there is an exponential 
decay for large x, but for = 6 a different short-distance behavior can already be seen 
emerging. For very large Ly and jc up to ~ Ly, one should expect the correlation function 
to be of the 2D form, for both even and odd Ly, i.e., C{x) should approach the value 
~ 0.095 of the 2D squared staggered magnetization (illustrated with data in Fig. 5). 
For X ^ Ly, there should be a cross-over into either an exponential fall-off (for even Ly) 
or the 1 /x form with a logarithmic correction (for odd Ly). In Fig. 69 one can see that 
the correlation at jc = 8 is almost the same for Ly = 4 — 6, but for larger x there is still 
an increase with Ly,. To clearly observe 2D behavior followed by cross-overs into either 
even- or odd-leg ladder asymptotic behavior, much larger Ly would be required. 

Susceptibility. The temperature dependence of the susceptibility provides a conve- 
nient way to extract the spin gap A for even-L^ ladders, and it is also interesting to 
investigate the applicability of the expected asymptotic form (284) for odd Ly. Fig. 70 
shows SSE results for = 1 — 6, with Lj^ sufficiently large for each temperature to con- 
verge accurately to the thermodynamic limit. For the lowest temperatures Lx — 1024 was 
used. For fixed T, convergence as a function of Ly can be seen for T /J > 0.2, where the 
results for Ly = 4,5, and 6 are almost the same. The converged curve corresponds to the 
2D limit (for which results for large Lx L lattices are shown in Fig. 73). The qualitative 
difference between ladders of even and odd width is clear at low temperatures, with x 
for even L^ decaying to zero exponentially below an Ly dependent temperature. For odd 
Ly, it can be noted that the T — )■ value decreases with Ly. One can argue that the T — )■ 
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FIGURE 70. Temperature dependence of the susceptibility of Heisenberg ladders of different width Ly. 
The length was sufficiently large for each T to represent accurately the — > °° limit. The right panel 
shows the low-temperature behavior for Ly = 2 — 6 on a more detailed scale. The statistical errors are not 
discernible, thanks to the use of the improved susceptibility estimator (283). 



susceptibility per rung should be roughly independent of Ly for small Ly [216], and, thus, 
the susceptibility per spin should scale approximately as \/Ly. We will not do any fitting 
to the asymptotic form (284) here, because it is valid only for very low temperatures for 
Ly>3 (and a higher-order logarithmic correction may be necessary unless extremely 
low temperatures are used [216]). Note that the ;^(0) ~ 1/Ly form is not inconsistent 
with the 2D behavior XiD (0) > 0, because the ladder behavior (for both even and off Ly) 
applies only below some T*{Ly) which goes to zero when Ly — )■ oo. 

Let us now analyze the Ly dependence of the spin gap A for the even-width ladders. 
One expects roughly an exponential low-temperature susceptibility, x ~ exp(— A/T), 
due to the fact that the ground state is a singlet (non-magnetic, with (M|) = 0) and the 
first excited state is a triplet (magnetic, with (M|) = 2/3). This behavior is, however, 
modified by the fact that there is a whole continuum of magnetic states (for L^ — )■ °°) 
above A. A low-temperature form of the uniform susceptibility of the 2-leg ladder, 

X{T) = -^e-V^ (285) 

has been obtained by analyzing the limit of weak inter-rung couplings perturbatively 
[218]. This form can be expected to hold asymptotically for T — )■ for any Jy/Jx and 
even Ly (for — ?• 0°). The gap can be extracted from numerical x{T) data by considering 
the logarithm of \/Tx in (285), giving 

-rin(v^;i:) =A-nn(a). (286) 

Thus, the intercept of a line fitted to —T\n{\/Tx) versus T equals the gap A. Fig. 71 
shows data analyzed in this way. In all cases, a linear behavior obtains at low tempera- 
tures, and line fits give the gaps listed in the figure. A field theoretical treatment shows 
that the gap for a spin-5 ladder system should decrease with Ly as A ~ exp(— ;r5Ly) 
[220], which is not in good agreement with the results obtained here (and less precise 
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FIGURE 71. The susceptibility of even-width ladders analyzed according to Eq. (286). Extrapolations 
in the linear low-T regime, shown as lines extending through the points used in line fits, give the spin gaps 
indicated in the figure. The statistical precision is roughly the number of digits shown. 



earlier calculations [216]). Calculations for larger Ly would have to be carried out (which 
can certainly be done) to investigate this issue in more detail. 

It would also be interesting to study the odd-leg ladders for larger Ly and lower 
temperatures than what has been done until now, to extract the L,, dependence of the 
parameters c and Tq in (284). An alternative approach is to map the low-energy properties 
of a ladder onto a single chain with longer-range interactions [221], which arise because 
the 5=1/2 degrees of freedom of the isolated rungs are not completely localized to 
individual rungs once they are coupled. The localization length increases with Ly, and 
eventually, for Ly — )■ oo, an effective model with sufficiently long-ranged interactions to 
produce Neel order (as discussed in Sec. 4.3.3) should obtain. The mapping procedure 
has not yet been tested for ladders with large Ly, however, and one should therefore also 
study the full ladder systems at lower temperatures and for larger Ly. 

5.3.3. Long-range order in two dimensions 

We already discussed the nature of the antiferromagnetically ordered (Neel) ground 
state of the 2D Heisenberg model in Sees. 2.1 and 4.4.2. Extrapolation of the best 
currently available QMC results [50] for the squared sublattice magnetization, shown in 
Fig. 5 gives m., = 0.30743(1) for the infinite system, where (1) denotes the statistical 
error in the preceding digit. This result deviates only by about 1% from the linear 
spin wave result m., = 0.3034. Higher-order spin wave calculations [40, 41, 42] give 
nis ~ 0.3070. As discussed in Sec. 2.1, this good agreement with the actual value can be 
traced to the fact that the quantum fluctuations are not that strong, reducing the sublattice 
magnetization by only about 40% from the classical value. Other important ground state 
quantities, such as the spin stiffness [222], the transverse magnetic susceptibility, and 
the spin wave velocity, are in similar good agreement with spin wave theory (but we will 
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not make any detailed comparisons of values here). 

After discussing some general data fitting issues, we will here compute the energy 
and the spin stiffness using finite-size extrapolations of T — > converged SSE results 
(with proper convergence confirmed using checks such as those shown in Fig. 65 for 
all quantities of interest). In addition to the L x L lattices normally used, we will also 
consider rectangular LxXLy lattices with Lx — 2Ly. This enables a consistency check of 
the extrapolations. We also discuss the finite-temperature susceptibility and extrapolate 
it to zero temperature, using Lx L lattices sufficiently large to completely eliminate 
finite-effects for the range of temperatures considered. We finally discuss the divergence 
of the correlation length as T ^ 0, which is another important manifestation of the Neel 
ordered ground state [5]. 

Extrapolations of the ground state parameters of the 2D Heisenberg model have been 
presented in a large number of papers previously, e.g., Refs. [223, 224, 205, 225, 226]. 
For most practical purposes, the precision already achieved is quite sufficient. Since the 
model is one of the most important prototypical systems in quantum magnetism, it is, 
however, useful to continue to establish more precise benchmark calculations (which 
can be useful, e.g., for testing other methods [94]). Some of the results presented below, 
in particular the ground state energy, represent the most precise calculations to date. 
In addition to the extrapolated AT = oo, r = values of the various physical quantities, 
their finite-size and temperature corrections are also of interest, because they have been 
predicted in great detail based on field-theoretical methods [227], and numerical test are 
useful to establish the range of validity of the low-energy theories. The corrections are 
statistically much noisier and require very long simulations to establish precisely. The 
discussion here is only intended to give a flavor of what can be done. 

Data fitting issues. When extrapolating finite-size result to the thermodynamic limit, 
it is useful to know the expected form of the size corrections based on analytical calcu- 
lations. Away from a critical point, the size corrections for systems of dimensionality 
d > I normally take the form of a polynomial in the inverse system length 1/L, but 
some times the leading correction is ex l/L" with a an integer larger than 1. As dis- 
cussed in Sec. 2.4.1, the leading size correction to the sublattice magnetization of the 
2D Heisenberg model can be obtained from the spin wave theory of the Neel state and is 

1/L. This result is also a more general consequence of the fact that the order parameter 
is a vector, as discussed in Sec. 3.3. The ground state energy E/N per site has a leading 
correction oc and thanks to this high power it is relatively easy to obtain the energy 
to high precision even based on rather small lattices [205]. The leading correction to the 
spin stiffness is 1/L [228]. Even if the leading power is not known, one can normally 
find it empirically based on data, provided that the numerical precision is sufficiently 
high. When fitting data to a polynomial one may find that the coefficient of the linear 
term (and possibly higher-order terms) is very small, which then makes it likely that it 
actually should be exactly zero. It is best to use a fitting program which allows one to 
specify exactly what powers in 1/L should be included. 

For a given set of data (here a quantity computed for a set of lattice sizes, but we will 
use the same technique also for fitting temperature dependent quantities as powers in T) 
one should include high enough powers of 1/L for the fit to be statistically sound, i.e., 

per degree of freedom (x^/dof) should be close to 1 for a large data set (as discussed 
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FIGURE 72. Size dependence of the ground state energy (left) and spin stiffness (right) of 2D Heisen- 
berg lattices with aspect ratio R — 1 (where — iJ) and 2 (where = 2L^). For R — 2, the spin stiffness 
on a finite lattice is different in the x and y direction. Both values are shown here along with the average. 
The curves are polynomial fits discussed in the text. 



in standard texts on data analysis). The statistical errors of the fitting parameters can 
be computed based on the error bars of the data. The safest way, which does not rely 
on any assumptions of the errors being small (although normally they should be), is to 
carry out a large number of fits with Gaussian noise added to the data (with the standard 
deviation of the noise equal to the corresponding error bar) and compute the standard 
deviation of the distribution of the resulting parameter values. Note that when increasing 
the number of fitting parameters (here powers of 1/L), the error bars of the best-fit 
parameters normally increase (and x/^of may also get worse). One should therefore not 
include many more powers than needed to bring ;^^/dof close to 1. To safeguard against 
possible remaining effects of higher-order corrections, one may nevertheless want to 
include one more parameter than strictly needed for and acceptable x^/^of. 

Ground state energy and spin stiffness. The energy per site E/N and the spin 
stiffness were evaluated using the estimators (250) and (280), respectively, with the 
energy adjusted for the constant 1/4 subtracted from each bond operator in the SSE 
simulations. As discussed in Sec. 5.2.4, the value of the spin stiffness extrapolated to the 
thermodynamic limit should be multiplied by a factor 3/2, to account for the fact that 
the spin-rotational symmetry is not broken in the SSE simulations. 

Comparing extrapolations for lattices of different shapes is a good way to check for 
detectable consequences of finite-size corrections beyond those included in the data fits 
[82, 94]. Fig. 72 shows results for both N = LxL (aspect ratio i? = 1) and = 2L x L 
{R — 2) lattices, graphed versus 1 / \/N (= 1/L for i? = 1 and 1/L for R = 2). These 
results were computed using inverse temperatures j8 as high as 32 x L for the largest 
systems (L up to 40 for R = 1 and up to 32 for R = 2). For the largest systems several 
hundred CPU hours were used. The error bars are too small to be visible in the figure. 
An example of the results produced, for the 32 x 32 system E/N = —0.6695115(8) 
and p, = 0.12606(5), based 5 x 10^ Monte Carlo sweeps. The error bars are of similar 
magnitude for the other systems as well. 
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For R = I using the systems with L > 6 and a 5th-order polynomial (without the 
linear and quadratic terms, which are predicted not to be present, as discussed above) 
gives E/N = -0.6694421(5), while a fit of the same order to the = 2 data with 
Ly>6 delivers E/N — —0.6694422(6). These results are in perfect statistical agreement 
with each other, and it is then permissible to use their statistically weighted average, 
E/N = —0.6694421 (4), as a final estimate of the ground state energy. It should be noted 
that the numerical values depend slightly on what lattices are included in the fit and the 
order of the polynomial used. Once the fit is statistically sound, these fluctuations should 
be consistent with the statistical errors. 

Turning now to the stiffness, note first that for a finite lattice with aspect ratio Ry^ I, 
there are two stiffness constants, for phase twists imposed in the x and y direction. Both 
of them are graphed for i? = 2 in the right panel of Fig. 72, along with the arithmetic 
average of the two and the R = I values. It can be seen here that the R = 1 stiffness 
and the average of the i? = 2 stiffnesses are better behaved for extrapolations than the 
individual x and y values for R = 2, and no fits are therefore included for the latter. 
Quadratic fits for i? = 1 and = 2 (using L > 6 and L^^ > 8 data) gives ps = 0. 12065(4) 
and ps{°°) = 0. 12070(6), respectively. Taking the average of these statistically consistent 
values, and including the rotational factor 3/2, gives the final estimate ps — 0.18100(5) 
for the infinite system. 

Properties at T > 0. Since the Mermin-Wagner theorem rules out magnetic order 
in the 2D Heisenberg model for any T > 0, the correlation length must diverge as 
r — )■ for the behavior to be consistent with the ordered ground state. The behavior 
is similar in the classical 2D Heisenberg model, where the correlation length diverges 
exponentially. The long-distance correlations and fluctuations of the quantum system 
can in fact formally be mapped onto a classical system with renormalized couplings 
[5], and the low-temperature regime in which such a mapping holds is referred to as 
the renormalized classical regime. Many predictions in this regime have resulted from 
field-theoretical treatments [5, 227, 84]. Remarkably detailed results have been derived 
for the temperature dependence of, e.g., the correlation length and the susceptibility. The 
predicted forms, which are believed to be asymptotically (r ^ 0) exact, depend only on 
the ground state parameters (e.g., the spin stiffness and the spin wave velocity c). 

Let us first investigate the uniform magnetic susceptibility, which should have the 
following form for T ^ (in an infinite system) [227] : 



The only model dependent parameter here is the ground state spin stiffness, which we 
determined above. To compare numerical data with this result, we first have to make 
sure that we can achieve the thermodynamic limit. For fixed temperature, we can study 
the behavior as a function of the system size, where we here use quadratic lattices with 
N — L}. The left panel of Fig. 73 shows results for several lattice sizes. For any finite 
system the susceptibility vanishes as T — > due to the singlet ground state and finite- 
size gap in the spectrum. As we discussed in Sec. 4.4.2, the lowest excitations are the 
quantum rotor states, with gaps scaling as \/N. The T /J < \/L susceptibility for a finite 
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FIGURE 73. Uniform susceptibility of the 2D Heisenberg model computed using several L x L lattices. 
The right panel shows low-T results for L = 128 and 256 on a more detailed scale (with the two data 
sets coinciding within statistical errors, indicating convergence to the thermodynamic limit). The solid 
curve in the right panel is a fit to the L — 256 data (using only the r/7 < 0.18 data points) of the form 
X = a + bT + cT^ with the parameters discussed in the text. Error bars are too small to discern but are 
typically « 2 x 10^^ for the low-temperature L = 256 results. 



system is dominated by these excitations. Once T > \/L spin waves can also be excited, 
and one may therefore suspect that a system size of at least L> l/T will be required 
for convergence to the thermodynamic limit. Here we will not study the convergence in 
detail, but just conclude based on the data in Fig. 73 that the L = 256 results are safely 
converged down ioT /J = 0.03 and can be used to test the form (287). 

Tests of analytical predictions can be carried out in different ways. We could here 
use the value of ps extracted from the finite-size extrapolations of the T = data above 
and check the agreement between the SSE results for x{T) and Eq. (73), with only 
X{0) adjusted as a fitting parameter to obtain the best agreement. Another way would 
be to also adjust p^, the best-fit value of which can be compared with the result of the 
ground state extrapolations. Here we will proceed in a different way, which tests both 
the T and corrections in (287). Fitting SSE data to a form % = ao + aiT + a2T^, we 
can use the form of the coefficients in (287) to extract corresponding estimates ps{^) 
and ps{2) for the spin stiffness from the linear and quadratic coefficients ai and a2. 
These estimates should agree with each other and with the result of the ground state 
calculation if the temperatures used in the fit are sufficiently low. At higher temperatures, 
corrections including higher powers of T will be important and should lead to a bad fit 
to the quadratic form and disagreements between the different stiffness estimates. 

The quadratic fit is good and consistent values of the exponents are obtained if only 
data for r/7 < 0.18 are included. The fit is shown in the right panel of Fig. 73, and the 
stiffness constants extracted from the parameters are pi(l) = 0.180(3) from slope and 
ps{^) = 0.179(6) from the quadratic correction. These values are in good agreement 
with (but noisier than) the ground state result ps = 0.18100(5). Fixing the form (287) 
and extracting the best-fit values of x{^) and gives a less noisy estimate [225], but 
since we are still dealing with corrections to a T = quantity it is not easy to achieve 
the same precision as we did by extrapolating the finite-size values of p^ at T = 0. 
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FIGURE 74. The temperature dependent correlation length (graped versus the inverse temperature) of 
the 2D Heisenberg model c ompute d for different lattice sizes using the spin structure factor definition 
(70), divided by the factor ^15/16 coming from Eq. (72). The dashed curve is the expected asymptotic 
(L = oo, 7" — > 0) form (289) with the parameters Pv = 0.1810 and c = 1.660 determined previously. 



The r — i- susceptibility extracted from the above fit is ao = x{^) = 0.04378(3). As 
we discussed in Sec. 4.4.2, the transverse susceptibility x± in the symmetry-broken state 
at r = is this number multiplied by 3/2, giving x± = 0.06567(4) . In combination with 
the spin stiffness, this value gives us access to another ground state parameter; the spin 
wave velocity. We discussed the spin stiffness constant as an elastic modulus in Sec. 3.5. 
This elastic-medium approach to the long-wavelength properties of quantum magnets is 
also referred to as the hydrodynamic description [229]. The transverse susceptibility is 
there analogous to a mass density, and one can relate the spin wave velocity to the spin 
stiffness and the susceptibility according to 



Using the values extracted above for ps and x± gives the velocity c = 1.6602(6). Recall 
that linear spin wave theory (Sec. 2.1.1) gives c = y/l. The difference between these 
two values is captured very well by spin wave theory including 1/5 corrections, and 
the "renormalization factor" of c can also be computed quite precisely using a series 
expansion around the Ising model [230]. 

The temperature dependence of the correlation length of the infinite system is also 
known, including a correction to the leading exponential divergence [5, 227]: 



We have already determined the two parameters involved here, the ground state spin 
stiffness and velocity, and we will use their values quoted above to directly test the 




(288) 




(289) 
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analytical prediction against SSE results based on the "second moment" structure factor 
definition (70) of the correlation length. 

Since the correlation length is a rapidly divergent quantity, very large lattices are re- 
quired to converge the calculations to the thermodynamic limit at low temperatures. The 
lattice length L should be several times larger than ^ to achieve complete convergence. 
Fig. 74 shows results for several lattices sizes up to L = 256. While larger systems have 
been studied in the past [225, 226], already these results, which are well converged for 
T /J > 0.25, show that the form (289) describes the numerical data very well. The small 
deviations have been discussed in the literature [225, 226] and can be understood as 
arising from higher-order corrections to the form (289) . 

5.3.4. Quantum phase transition in a dimerized system 

We now study the quantum phase transition in the 2D columnar dimerized Heisenberg 
model illustrated in Fig. 4(b). The phase transition of the ground state takes place as a 
function of the coupling ratio g = Jij J\ > 1 and is caused by quantum fluctuations, 
which here correspond to an increasing density of singlets on the dimers and at some 
point lead to the loss of the Neel order existing when I. We already examined some 
SSE results for the sublattice magnetization of this system in Fig. 5, and the behavior 
indicated a phase transition between a Neel-ordered and a nonmagnetic ground state 
at coupling ratio g ^ 1.9. In this section we apply the machinery of finite-size scaling 
at criticality, which we discussed in the context of classical systems in Sec. 3.3 and 
generalized to quantum systems in Sec. 3.6. Here we first analyze the critical behavior 
of several quantities calculated at sufficiently low temperatures to access the ground state 
critical behavior, and then discuss consequences of the T = quantum critical point at 
non-zero temperatures. Before that, let us spend a few words on the implementation of 
the SSE algorithm for a model with non-uniform couplings. 

SSE method. An appealing feature of the SSE algorithm for the Heisenberg model 
is that the coupling strengths only enter in the acceptance probabilities (265) and (266) 
in the diagonal updates, where the coupling J is absorbed in the inverse temperature 
jS = J /T . For non-constant couplings, we only have to define j8/, = Ji,/T for each bond 
b. In the implementation in pseudocode {27}, some scheme has to be used to identify 
the coupling corresponding to a generated bond b in an insertion attempt. In a system 
with a large number of different couplings (which could even be random, different for 
every bond) the couplings (in the form of j8^) should be stored as a list. For systems with 
just two different couplings, such as the dimerized lattice considered here, it is better 
to just order the bonds in such a way that the coupling can be quickly determined, e.g., 
with b < N/1 and b > N/2 corresponding to J2 and Ji, respectively. Note that for an 
operator removal, unlike the case of uniform couplings in code {27}, we now also need 
to extract the current bond, b = S{p) /2, in order to determine the probability. 

Finite-size scaling for 7—^0. When analyzing the ground state, we can proceed as 
in the case of the standard uniform 2D Heisenberg model in the previous section, simu- 
lating systems at sufficiently low temperatures to achieve T = convergence of all the 
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FIGURE 75. Binder cumulant (left) and spin stiffness (in the x direction) multiplied by the system 
length (right) of the dimerized Heisenberg model. The crossing points of these curves for different L tend 
toward the critical value of the coupling ratio g. Error bars are much smaller than the symbols. 



quantities of interest. This approach is discussed for various dimerized systems in, e.g., 
Refs. [85, 88] (as well as in many older works). Another approach is to study systems 
at inverse temperature j8 = L^, where z is the dynamic critical exponent (which we dis- 
cussed in Sec. 3.6) [231]. This is motivated in the following way, by a generalization of 
the finite-size scaling hypothesis (64): In a quantum system the scaling function /L) 
should be replaced by a function with two arguments, /((^ /L, ^t^/Lt:), where the correla- 
tion length in the imaginary time dimension depends on the spatial correlation length ^ 
according to ~ (which defines the dynamic exponent) and the length of the system 
in the imaginary time direction is L-r = c/T ~ /3 (where c is a velocity). If we choose 
j8 L^, then the scaling function can be written as f[^/L, /LY], which is a function 
of the single argument ^/L. Thus, the finite-size scaling procedures can be used exactly 
as in the classical systems discussed in Sec. (3.3.2). This is the case also if we take the 
limit /3 — 7- oo for each L (in practice finite /3 large enough for convergence to this limit), 
because then ^x/Lx — )■ 0, and there is again only one argument t,/L left in the scaling 
function. 

There is plenty of evidence already that z = 1 in dimerized Heisenberg models, and 
we will here use systems with /3 = L. This allows for studies of larger systems than in 
the /3 — 7> oo limit, although it is not a priori clear which approach is in the end better, 
since the corrections to the leading finite-size scaling behavior can be different. Here we 
use L up to L = 128. We will also test explicitly that systems with /3 = L exhibit behavior 
consistent with z= 1, by studying quantities which depend on z- 

We first locate the critical coupling by examining quantities that should be size 
independent at gc. Fig. 75 shows the g dependence of both the Binder cumulant and the 
spin stiffness, with the latter multiplied by L to compensate for the expected quantum 
critical scaling form Ps ^ \/L, obtained the classical form (99) with J — )■ J = 3. 

The Binder cumulant is defined according to (77), with the number of components 
n — 3. Note, however, that (77) is defined with the full scalar product = m • m in 
(75), whereas with the SSE method we here only compute the z component expectation 
values (m^) and (m^) (the off-diagonal components being more difficult to evaluate 
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FIGURE 76. Size dependent critical coupling for the dimerized Heisenberg model extracted from 
(L,2L) crossing points of the Binder cumulant, the spin stiffness constants, and the correlation lengths. 
The curves show fits to the form gc{L) = gc{°°) + a/ L"' . 

[232]). One can easily find the geometrical factors relating these by integrating the z 
component cos(0) of a classical 3D unit vector over the the angles, giving (m^) = 3 (m^) 
and (m^) = 5 (m^) . For locating the critical point, these factors play no role, and we could 
also use the plain Binder ratio defined as 7?2- = (ml) / (w^)^. 

Since the dimerized lattice does not have 90° rotational symmetry, the stiffness con- 
stants in the x and y directions are different. Although the numerical values are indeed 
quite different, their scaling behaviors close to the critical point is very similar, how- 
ever [the X stiffness is approximately a factor 2 larger — the dimers are oriented in the x 
direction as in Fig. 4(b)]. Only the x stiffness is shown in Fig. 75(b). 

Curve crossings are indeed seen in Fig. 75 for both U2 and psL, and after some 
significant drift of the crossing points (e.g., for systems of size L and 2L) for small 
L, they seem to converge to roughly the same value in both cases. Note that the crossing 
points for U2 and ps approach gc from opposite directions, which can be useful for 
bracketing the critical value [85, 88]. Crossing points can be located numerically by 
fitting a polynomial of suitable order to some of the data points, repeating the procedure 
several times with added Gaussian noise to compute error bars. Fig. 76 shows results 
of such procedures for the Binder ratio, the x and y stiffness constants, as well as 
the correlation lengths [computed using the definition (70)] in both the x and the y 
direction. Fits to the data points of the form gc{l-) = gc{°°) -\-a/L^ are also shown. 
This form describes well all the data for L > 10 (the sizes shown in the figure). All the 
extrapolated values of gc fall within the range [1 .9094, 1 .9096] , and if the L = 10 data are 
excluded the range narrows even further. The exponent ft) is in the range 2 ~ 2.5 for all 
quantities (being largest for U2)- Treating all five values obtained in these extrapolations 
as independent statistical data gives gc = 1.90948(4) as a final estimate for the critical 
point. This is in good agreement with (but with smaller error bar than) a recent estimate 
gc — 1.9096(2) obtained using T — )■ data for the same quantities on lattices with L up 
to 64. The crossing point shift exponents ft) are also in good agreement. 
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The existence of a limiting value of the location gc at which the Lps curves cross does 
not prove that z — I. One also should check that the values of Lp^ are well behaved, i.e., 
that a crossing point in the plane (g, p^L) really forms. A plot such as Fig. 76 for pgL at 
the crossing points for lattices of size L and 2L confirms that this is the case. Examining 
the Binder cumulant in a larger window of couplings, one can not find any indications of 
negative values, which would be a sign of phase coexistence at a first-order transition (as 
discussed in Sec. 3.4). Thus, the scaling behavior so far supports at continuous quantum 
critical point with z = 1. 

We could now proceed to perform data collapse fits in order to find the correlation 
length exponent V, as we discussed for classical systems in Sec. 3.3. This has been done 
for dimerized Heisenberg model, including also scaling corrections [85, 88], and the 
results are in good agreement with the expected 3D classical Heisenberg universality 
class. Here we instead just discuss the exponent 7] appearing in the critical correlation 
function (53), where we should again replace d hy d + z — 3. The staggered structure 
factor 5'(;r, n) is the spatial integral (sum on the 2D lattice) of the correlation function 
(269), while the Kubo integral (271) for the staggered susceptibility x(;r, 7c) corresponds 
to a 3D space-time integral (a 2D lattice sum and an imaginary time integral). Performing 
these integrals with the above critical form of the correlation function, with a cut-off 
equal to the system size L, gives 5(;r,;r) r^L^~^'''^ and;^(;r,;r) ~L^~''. We will test this 
behavior of the critical system and extract the exponent r\ , using the value of gc found 
above. Instead of performing new simulations at this point (which is known only to 
within a statistical error), one can perform polynomial interpolations within the existing 
data sets. One can then also easily check the behavior for points slightly off the best 
estimate of gc (plus and minus one error bar), to check the sensitivity of the fitted r\ to 
the location of the critical point. The simplest way to analyze the critical behavior is to 
fit a straight line to ln(5') and \n{x) versus ln(L) (as was done for S{7t, %) of this model 
in [88]). Some corrections to scaling are always expected, and if the data have small 
error bars a straight line can only be fitted to large lattices. With the data sets used here, 
statistically acceptable linear fits are only obtained when using system sizes L > 48. We 
will therefore also include subleading corrections and assume the following forms 

S{Tt,Tt) = aL}~'^ +bL'^, X{Tt,Tt) = aL^-'^ +bL'^, (290) 

where one would expect the subleading exponent (o to be much smaller (possibly even 
negative) than the leading exponents. One could in principle perform a combined fit with 
7] fixed to be the same for the two quantities (with (O and the constants different), but 
here we will fit the data sets independently. 

In order to more clearly see the role of the subleading correction, S{%, k) and X i^-, ^) 
are graphed in Fig. 77 with the dominant L and factors divided out. The asymptotic 
L — )■ oo behavior is then in both cases ~ L"^ , where T] is expected to be small. The cur- 
rently best estimate for the classical 3D Heisenberg universality class is rj = 0.0375(5) 
[113]. The fits to (290) give r} = 0.029(2) for 5(;r,:/r) and 77 = 0.020(3) for x{^,^)- 
In principle the forms (290) should of course work only exactly at gc, but in practice, 
for a finite range of system sizes, they fit the data well in some window around the 
true critical point. As seen in Fig. 77, the interpolated values of the two quantities at 
gc plus and minus one error bar deviate visibly from those at the midpoint, but the fits 
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FIGURE 77. Finite-size scaling of the staggered structure factor (left) and susceptibility (right) in close 
proximity of the estimated critical coupling ratio gc ~ 1 .90948(4). The powers of L corresponding to z = 1 
and Tj = have been divided out. The remaining asymptotic size dependence should then be governed by 
the actual value of rj. The fits to the forms in (290) give 77 = 0.029(3) from 5(7r,;r) and 77 = 0.020(4) 
from ;i;(;r, n). The fits were based on L > 8 data, but the resulting curves also go through the L~4- and 6 
points. The dashed curves shows the behavior at g = 1 .90948 without the scaling corrections (i.e., with 77 
and a kept at their values obtained in the fit including the corrections). 



are statistically acceptable in all three cases. The statistical errors quoted above arise 
predominantly from the uncertainty in the critical coupling. The subleading exponents 
in (290) are CO = -0.2(2) for S{7t, 7t) and CO = 0.6(2) for ;t(;r, 7t). 

Fig. 77 also shows the fitted functions with the subleading corrections disregarded 
(with the other parameters kept at their values obtained in the fit with corrections). 
Clearly the corrections are quite significant, being completely responsible for the maxi- 
mums in both curves at L ^ 10. One can of course obtain much better fits to the larger 
lattices without subleading corrections. As mentioned above, with the rather small er- 
ror bars of the data used here, only systems with L > 48 can be included in such a fit. 
Even then, there must be some influence of the neglected corrections. The values of rj 
do in fact come out somewhat lower if no corrections are included. With the corrections, 
all the data (L > 4) can be included, but to be on the safe side only L > 8 data were 
included in the fits quoted here and graphed in the figure. These fits still do pass very 
closely through the L = 4 and 6 data points, which further reinforces the quality of the 
functional form used. 

The rj values obtained here, and also in Ref. [88], are a bit lower (by a few error 
bars) than the best available classical 3D Heisenberg value [113] quoted above. Most 
likely, these small discrepancies are due to further scaling corrections, but it would still 
be good to push QMC calculations for various dimerized Heisenberg models to even 
higher precision (using larger lattices, a denser grid of lattices sizes, and also reducing 
the error bars of the computed quantities) in order to establish the agreement with the 
classical exponents for sure. This is particularly interesting and important in light of 
the fact that the staggered dimer model illustrated in Fig. 4(c) (and also some other 
dimerization patterns) seems to show small but statistically significant deviations from 
the expected 3D Heisenberg exponents [89, 233, 234]. These systems may still not be in 
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FIGURE 78. Cross-overs (indicated by thick curves) in the coupling-temperature plane of a generic 
dimerized 2D Heisenberg model. The quantum-critical point (circle) controls the behavior in the T >0 
quantum critical (QC) regime, up to some temperature where lattice effects become important (i.e., when 
1^ ^ 1). For couplings away from the critical point there is a low-temperature cross-over into either 
renormalized classical (RC) or quantum disordered behavior For a 2D system there is Neel order only 
exactly at T = 0. The thickness of the curves separating the different regimes reflect the fact that the 
cross-overs are not sharp, but can take place over significant temperature windows. 



a different universality class, as originally proposed [89], because the deviations could 
also originate from anomalously large scaling corrections [233, 234]. Regardless of the 
underlying reason for the deviations, their origin remains unclear and should be further 
examined (and such work is in progress [234]). 

Quantum critical scaling at T > 0. One of the most remarkable and important as- 
pects of quantum criticality is that the properties (the universality class) of a T = 
critical point also can strongly influence a system at T > 0, often up to rather high tem- 
peratures [84] and also if the system does not have exactly the couplings corresponding 
to a r = critical point (but is near such a point, in a sense which we will make more 
precise below). Critical fluctuations and scaling behavior can therefore be much more 
generic features of quantum systems than classical systems [142], where critical behav- 
ior often can be observed only very close to the critical point. 

In a quantum system, critical behavior can be expected when the temperature is the 
dominant energy scale. In dimerized 2D Heisenberg models, the energy scales on the 
Neel and nonmagnetic side of the transition are, respectively, the spin stiffness ps and 
the singlet-triplet excitation gap A. One can therefore expect manifestations of quantum 
criticality for T ^ and T ^ A, for ^ < gc and g> gc, respectively. In continuum field 
theories there is no upper-bound temperature for the quantum critical behavior, while 
for a lattice hamiltonian there has to be a cross-over to a different behavior above some 
temperature where the spins become essentially independent due to thermal fluctuations 
(which cannot be captured by continuum field theories, where independent discrete spins 
do not exist). The cross-over to the high-temperature limit could be defined, e.g., on the 
basis of the Curie (independent spin) susceptibility X ^ i^T) obtaining for T ^ J. 
As we will see below, the quantum critical susceptibility is completely different. 

Putting all this together, for a 2D dimerized Heisenberg model one expects a generic 
"cross-over diagram" [5, 84] of the type shown in Fig. 78. For temperatures below the 
high-r cross-over, there are three different "regimes", in which the behavior is controlled 
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be the corresponding largest energy scale in the problem; Ps, A, or T. In the renormalized 
classical (RC) regime, the spin stiffness dominates, and the correlation length diverges 
as r — > according to (289), as we saw explicitly for the standard uniform Heisenberg 
model in Sec. 5.3.3. In the quantum disordered (QD) regime, the correlation length 
saturates at some finite value as T — )> 0, with — )■ constant roughly below T^A.ln the 
quantum critical (QC) regime, the correlation length should diverge at gc when T ^ 
according to 

(^~l/r^ (291) 

This behavior can be understood based on the path integral mapping of the quantum 

system in d dimensions onto a classical (J + z)-dimensional system. There the coupling 
g of the quantum system corresponds to T of the (d + z)-dimensional system. We we can 
then think of temperature scaling in the quantum system as a kind of finite-size scaling 
in the imaginary time dimension, in which the thickness of the effective system is finite; 
L^<x l/T (i.e., we keep T constant and just change the thickness of the effective system). 
By definition of the dynamic exponent, at gc we have 

^ ~ ^y^. (292) 

If the time dimension thickness Lt: = oo, both the correlation lengths ^ and diverge at 
this point, but with finite (but infinite spatial size L) we should replace by l/T 
in scaling formulas (exactly as we do in classical finite-size scaling when L<^), whence 
(292) gives (291). 

Quantum critical scaling forms have been derived for many quantities based on these 
ideas worked out quantitatively and in great detail [84] using renormalization group 
methods and large-A^ expansions [with here being the number of components of 
SU(A^) symmetric spins, with = 2 for the physical 5=1/2 spins] within the continuum 
field theory [the (2 + 1 ) -dimensional nonlinear sigma-model] . The main point is that one 
can observe scaling, with some corrections (the size of which depends on the quantity), 
also when g ^ gc, in the QC regime illustrated in Fig. 77. Away from gc, there is a cross- 
over into either RC or QD behavior. Here we only look at two important quantities; 
in addition to the correlation length we also analyze the uniform susceptibility, which 
is perhaps the quantity for which QC behavior away from gc is the most robust. The 
predicted form is linear in T exactly at gc, with a constant shift away from gc', 

X = ^T + b, {b = Oatg = gc), (293) 

where c is the spin wave velocity. The constant a is not known exactly, but its value 
computed based on the leading terms in an l/N expansion is believed to be close to the 
actual value [84]. The linear behavior should apply strictly only when T — > 0, since there 
are higher powers of T, with unknown prefactors, which are not included in (293). 

We begin by investigating the finite-size convergence of the susceptibility at gc (using 
gc = 1.9095, within the error bars of the critical point determined above), in order to 
make sure that we can reliably obtain results reflecting the thermodynamic limit at low 
temperatures. The left panel in Fig. 79 shows the temperature dependence ofx/T (which 
is easier to look at in graphs than x itself) for several power-of-two system sizes up to 
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FIGURE 79. Susceptibility divided by temperature for the dimerized system at the critical point {g = 
1 .9095) computed using different lattice sizes. The peaks seen in the left panel for small systems is a finite- 
size feature which moves toward T = as 1/L. The solid curve in the right panel is a cubic polynomial 
fit to the L = 512 data {T/Ji < 0.20), which gives the prefactor a/c^ ^ 0.0916(1) in Eq. (293). For 
comparison, in the right panel, size-converged results for for g — 1 .9090 are also shown (the lowest curve, 
marked with +). In this temperature range, quantum critical behavior is seen also at this coupling, which 
is slightly on the nonmagnetic side of the transition (and hence x/T ^ eventually as T — ^ 0) 



L = 256. As expected, the results converge quickly at high temperatures, with finite-size 
effects entering at approximately T ~ 1/L (which can be expected on account of the 
dynamic exponent z = 1). The right panel of Fig. 79 shows the low-T results on a more 
detailed scale, including also results for L = 512. Based on this comparison of results 
for different sizes, one can conclude that the thermodynamic limit can be studied with 
L < 512 lattices down ioT /J\ = 0.03 (and probably even a bit below that). 

The right panel of Fig. 79 also shows a fit of to a cubic polynomial at low 
temperatures. The corrections to the asymptotic linear form (293), is quite prominent, 
leading to an ^ 8% increase in x/T [which can be considered as effective temperature 
dependent prefactor a/c^ in (293)] from the minimum around T /Ji ^ 0.3 to the eventual 
r — value. In principle the extrapolated T = value a/c^ = 0.0916 can be used to 
extract the spin wave velocity, but since the prefactor a is not known exactly [84], this 
estimate would likely not be very precise. Another use of the result obtained here would 
be to use it in combination with an accurate estimate of c obtained in some other way, 
which would allow a test of the approximate calculation [84] of the prefactor. This is 
beyond the scope of the discussion here, however. 

Fig. 80(a) shows the temperature dependence of x g = gc and at two values some 
distance away on either side of the critical point. There is a broad maximum at T ^ 7i, 
which corresponds to the cross-over into the eventual high-temperature Curie behavior. 
Below the maximum, these near-critical systems all exhibit an approximately T-linear 
susceptibility, in accord with the form (293) with b < and b > for g > gc and 
g < gc, respectively. At still lower temperatures, there will be cross-overs into RC or 
QD behavior, which are not seen clearly here because the temperatures are still too high. 
In the QD regime X exponentially for T below the gap A. The RC form is also linear, 
like the QC form, but the slope changes in the cross-over region. Such a cross-over can 
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FIGURE 80. Temperature dependence of the susceptibility (left) and the correlation length (right) in 
critical (g — 1.9095) and near-critical systems. The solid lines show the asymptotic quantum-critical T- 
linear behavior of x (from the T = Q value of the fit in Fig. 79) and the 1 /T-linear behavior of [following 
the form (292) with a small negative constant correction]. 



actually be seen in the standard 2D Heisenberg model, corresponding to ^ = 1, which 
may appear to be too far away from the critical point. In the left panel of Fig. 73 one can 
nevertheless see an approximately linear behavior of x in the range T /J E [0.3 ~ 0.5], 
before the RC behavior sets in at lower temperatures. The slope of X in the narrow 
window is in very good agreement with that obtained with the known spin wave velocity 
and the approximately calculated prefactor a in (79), which indicates that this behavior 
really is a manifestation of quantum critical behavior far from a quantum critical point 
[84, 225] (although the very good agreement may be to some extent fortuitous, since, as 
we concluded above, there are significant corrections to the purely linear form exactly 
at gc, at much lower temperatures than the QC window at ^ = 1). 

Fig. 80(b) shows the correlation length in the x direction (which is about 30% larger 
than the y correlation length at gc) at the same near-critical couplings as in Fig. 80(a). 
While the behavior is very linear, with a small constant correction to the asymptotic 
form (292) with z = 1, the results for the systems slightly off the critical point deviate 
significantly from linearity below T /Ji ^0.3. Being a divergent quantity for g < gc, the 
correlation length has much larger corrections to the critical form than the non-divergent 
uniform susceptibility. 



5.3.5. Neel-VBS transitions in J-Q models 

With the J-Q models introduced in Sec. 2.4.3, one can study quantum phase transi- 
tions at which not only the antiferromagnetic long-range order vanishes, but a different 
symmetry is broken in the nonmagnetic state as a VBS forms. Superficially, a VBS may 
seem rather similar to the nonmagnetic state of a dimerized Heisenberg model, because 
in both cases the system exhibits a pattern of strongly and weakly correlated nearest- 
neighbor pairs. These states are fundamentally very different, however, because in a 
"manually" dimerized Heisenberg model the hamiltonian itself breaks the translational 
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symmetry, and the strongly correlated bonds also correspond to the strongly coupled 
spin pairs. In contrast, in J-Q models, the hamiltonian obeys all the symmetries of the 
lattice, and the translational symmetry is spontaneously broken in the VBS state. The 
quantum fluctuations of this VBS, especially close to the phase transition, are therefore 
very different (and much more interesting). 

On the square lattice, which we will consider here, and with the types of interactions 
we choose, the VBS can form in four equivalent patterns; hence the broken symmetry is 
Z4. We will first discuss the standard J-Q model with four-spin couplings [17] (and the 
term J-Q model will then refer just to that particular case). In this case the Neel-VBS 
transition appears to be continuous and shows many of the hallmarks of the proposed 
[16, 235] (and still controversial [236]) deconfined quantum critical point. The VBS 
state in this case is most likely columnar, but the exact nature of the state is masked 
by an emergent U(l) symmetry in the VBS state (which in this model is always near- 
critical, with large fluctuations of the VBS order parameter). This is one of the salient 
features of the putative deconfined quantum critical point. Despite the many ways in 
which the behavior of the J-Q model agrees with the theory, there are also aspects of 
the Neel-VBS transition that remain unexaplained — although not necessarily in conflict 
with the theory, because detailed analytical calculations are very difficult [235, 237] 
and many properties of the deconfined quantum critical point in SU(2) spin systems 
are simply not known quantitatively based on the field theory proposed to describe the 
transition. We will here look at one example of strong corrections (possibly logarithmic) 
to the quantum critical scaling behavior, which had not been predicted by the theory of 
deconfined quantum critical points. 

We will also discuss some results for a different kind of multi-spin coupling, which 
leads to a staggered VBS pattern [like the dimer pattern in Fig. 4(c)]. The phase transi- 
tion in this case is clearly first-order [109]. We will discuss the reasons for the qualita- 
tively different nature of the transitions in the two models. 

SSE implementation for the J-Q model. We first discuss some implementation issues 
for studies of the J-Q model with the SSE method. The model was first studied with a 
projector QMC technique formulated in the valence bond basis (which we will briefly 
discuss in Sec. 6). It is, however, also easy to implement the SSE method for it [238], 
which was initially done using "directed loop" updates (a generalized loop algorithm, 
which we will also summarize Sec. 6). An SSE algorithm for the J-Q model can also be 
devised which is almost identical to the standard algorithm with operator-loops for the 
isotropic Heisenberg model. To see this, we can write the J-Q hamiltonian with four-spin 
interactions, Eq. (23), in terms of the diagonal and off-diagonal bond operators (255) and 
(256) used in the SSE algorithm for the Heisenberg model; 

H = -jY^{Hii,-H2b) -QY,{Hib-H2b){Hic-H2c). (294) 

[b] [be] 

As in the Heisenberg model [b] is a bond with two interacting spins [i{b) , j{b)] , while [be] 
denotes two parallel bonds [i{b)J{b);k{c)J{c)], corresponding to the singlet projector 
product SijSki in (23). We will consider these bonds arranged as in Fig. 8, with the 
summation in (294) taken over all translations on the lattice and including horizontal 
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FIGURE 81. Examples of vertices in the J-Q model. Only some of the allowed spin states of the 
vertex legs (open and solid circles) corresponding to the operators are shown. Open and solid bars 
indicate diagonal and off-diagonal bond operators, respectively. The J-vertices are identical to those in the 
Heisenberg model. The Q-terms in Eq. (294) are products of two bond operators, which when expanded 
out include all combinations of diagonal and off diagonal factors. Allowed loops pass only through one of 
the operator factors in the case of Q-vertices, as illustrated here with loop segments at all the permissible 
leg pairs. Flipping loops can lead to any combination of allowed operators and spin states. 



as well as vertical bond orientations. The scheme is, however, independent on how the 
singlet projectors are arranged, and also the generalization to an arbitrary number of 
bonds in the Q term is trivial. The only constraint is that we have to avoid sign problems, 
which we do if both 7 > and Q> [with the minus signs in (294), which corresponds 
to energetically favoring singlets on the bonds included in both the J and Q terms] . The 
absence of sign problems was discussed based on a sublattice rotation in [238], and it 
can also be demonstrated using the simple operator counting arguments used for bipartite 
Heisenberg models in Sec. 5.2. 

We now have J-vertices with four legs as well as Q-vertices with eight legs, as 
illustrated in Fig. 81. The Q-vertices can be considered as two J-vertices joined together, 
in all possible combinations of diagonal and off-diagonal parts arising from the four 
operators in the Q term of (294). It is then clear that we can proceed in the same way as 
we did for the Heisenberg model, updating the operator string and a stored state using a 
combination of diagonal and loop updates. The key is here again that the matrix elements 
are the same for all J- and Q-vertices (the values being J/2 and 2/4), which means that a 
loop update in which the type of vertex (J or Q) is not changed can always be accepted. 
In the case of the Q-vertices, the loops satisfying this constraint enter and exit at the 
same operator factor, as illustrated with loop segments in Fig. 81. 

Both J and Q diagonal operators are inserted and removed in the diagonal update. The 
simplest way to insert diagonal operators is to choose completely randomly among all 
the possible single-bond [b] and double-bond [be] instances in (294). There are each 
of horizontal and vertical bonds [b] and also each of horizontal and vertical bond pairs 
[be], for a total of 4N cases to choose from. The spins in the current state have to be 
antiparallel on the bond or bonds acted on by the chosen operator, and if that is the case 
the acceptance probability is a simple modification of Eq. (265), with either /3y = 7/ {2T) 
or Pq = Q/ (AT) replacing /3, depending on the type of operator inserted. The number 
of bonds Nb is replaced by the total number of bonds and bond pairs, i.e., 4A'^. The same 
modifications apply to the removal probability (265) as well. If the ratio 2/7 is much 
different from 1, which is the case in the VBS state and at the phase transition, it is better 
to take this ratio into account already when generating the bonds or bond pairs (but even 
the trivial random operator generation actually works very well). Note that Q is much 
larger than 7 in the parameter regime we are interested in, and it is therefore best to 
define the temperature in units of Q, i.e., setting 2 = 1 in the program. 
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FIGURE 82. Binder cumulant of the staggered magnetization of the J-Q model. The left panel shows 
results up to 7/2 = 1, where U2 approaches 1, indicating a Neel state. For smaller J/Q the cumulant 
vanishes with increasing size, as shown in greater detail in the right panel, demonstrating a nonmagnetic 
state for J /Q < 0.045 (below the crossing points, which accumulate at the critical point). The cumulant 
remains positive for all J/Q, showing that there is no phase coexistence, i.e., based on these results the 
transition is continuous. 



The loop update is identical to the one we developed for the Heisenberg model, with 
the simple extension that the linked vertex list (illustrated in Fig. 57 for the Heisenberg 
model) now has eight elements for each vertex, instead of only four in the case of the 
Heisenberg model. Although the J-vertices have only four legs, the most practical way 
is to allocate eight storage elements for all vertices, using only the first four of them for 
the J-vertices (and filling the unused ones with a negative number, so that they are never 
visited in the loop update). 

Continuous quantum phase transition in the J-Q model. The initial study of the J-Q 
model [17] established the existence of a VBS ground state for small J/Q and a Neel- 
VBS transition compatible with a z = 1 quantum critical point. These results were soon 
thereafter confirmed in SSE studies of the T > quantum critical regime [238, 239]. A 
later world line QMC study claimed evidence for a first-order transition [240], and this 
scenario was argued for also based on studies of other systems [236]. Recent calculations 
on very large lattices (up to L = 256) do not, however, find any indications of first- 
order behavior [110]. This is in violation of the "Landau" rule, according to which a 
direct transition between states breaking unrelated symmetries should be generically 
discontinuous, but it agrees with the theory of deconfined quantum critical points [16]. 
There are, however, significant unexpected scaling corrections in some quantities (which 
may have been mistaken for signs of a first-order transition in [240]). Here we will first 
discuss some of the evidence for a single continuous Neel- VBS transition. 

The Binder cumulant is a useful quantity for analyzing both continuous and first- 
order phase transitions. As we discussed in Sec. 3.4, with an example in Fig. 20, phase 
coexistence at a first-order transition leads to a negative divergent peak in the cumulant 
at the transition point. Fig. 82 shows results for the antiferromagnetic Binder cumulant 
U2 of the J-Q model, calculated with the SSE method using P — Q/T = L (motivated 
by the previous work showing that z = 1). For J/Q > 0.05, U2 increases with L and 
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FIGURE 83. Finite-size scaling of the spin (left panel) and dimer (right panel) correlation lengths of 
the J-Q model. Note that the crossing points for ^ and move with increasing lattice size in different 
directions, toward each other. 



tends toward 1 , showing that the system has long-range Neel order. For smaller coupling 
ratios the cumulant decreases to 0, with no indications of a negative peak developing. 
The crossing points are well behaved and indicate a critical point at {J/Q)c ~ 0.045. 

Next, we examine the correlation length, computed using the structure factor defi- 
nition (70) for both the spin-spin and dimer-dimer correlations. While the wave-vector 
for the spin (Neel) order parameter is (;r, n), a columnar VBS corresponds to (7r,0) or 
(0, ;r), which then is used as the reference point q = in (70). To compute the dimer 
correlations, we use the definition (25) but only include the diagonal correlations in the 
bond operator (26) i.e., 

B,,{r)=S'{r)S'{r + x). (295) 

Although the finite- size scaling properties of the correlation functions based on the 
rotational- symmetric and the z component definitions may be different [238], the corre- 
lation lengths extracted from these functions should still both exhibit finite-size scaling 
~ L (up to scaling corrections) at the critical point. 

We can use the scaled correlation lengths (^^ /L (spin) and ^d/L (dimer) to test whether 
the two order parameters vanish at the same point, or whether there could possibly be two 
different transitions. Fig. 83 shows the raw SSE data, which again was generated using 
j8 = L. The curves for different system sizes cross each other, although there is some 
drift, both horizontally and vertically, even for the largest lattices. While the crossing 
J/Q values for ^s/L and ^d/L are quite far apart for small lattices, they move toward 
each other with increasing system size, in a way consistent with a single critical point. 

Fig. 84 shows crossing points extracted from L and 2L data for several different 
quantities (those discussed above as well as the spin stiffness). It is interesting to 
compare this graph with the corresponding one for the dimerized Heisenberg model. 
Fig. 76. The convergence of the critical point is clearly much slower for the J-Q model. 
While the behavior for all quantities is almost linear in 1/L over some range of system 
sizes, for the largest systems the behavior starts to change, in a way which suggests 
an eventual convergence with some higher power of L to a common critical point 
{J/Q)c ~ 0.045. Recall that for the dimerized Heisenberg model the behavior is ~ 1/L" 
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FIGURE 84. Size dependent critical coupling estimates based on curve crossings for systems of size L 
and 2L, using several different quantities — the Binder cumulant IJ2, the size-scaled spin stiffness pjL, as 
well as the spin and dimer correlation lengths ^sjL and / L. The average of the /L and / L estimates, 
shown with solid squares, exhibits almost no size dependence. The average of these points for the four 
largest L gives the critical-point estimate [J lQ)c — 0.04498(3), which is shown with the horizontal line. 



with CO in the range 2 ~ 2.5, and this behavior applies already for rather small lattices 
(L ^ 8). Because of the slow convergence, it is difficult to estimate the critical point 
of the J-Q model precisely based on these individual crossing point estimates. There 
is, however, a remarkable feature seen in Fig. 84: The i^^/L and (^^/L crossing points 
form curves that look very symmetric, approaching the apparent asymptotic value at the 
same rate but from different sides. Therefore, the average of these two estimates exhibits 
almost no size dependence at all (as also shown in the figure). Based on these average 
values one can therefore obtain a much better estimate of the critical point than what 
might initially have been expected. The result based on the four largest-L points (which 
agree completely withn statistical errors) is {Q/J)c = 0.04498(3). 

The slow convergence of the crossing points indicates large scaling corrections in 
the underlying physical quantities. Note that the Binder cumulant crossings have the 
weakest size dependence of the quantities analyzed in Fig. 84, which is also clear 
when comparing the raw data in Figs. 82 (cumulant) and 83 (correlation lengths). 
Since the Binder cumulant involves a ratio of two similar quantities, anomalous scaling 
corrections in these may partially cancel each other. It is not at all clear why the finite- 
size corrections in the crossing points for the two correlation lengths cancel each other 
almost completely. It may indicate some hidden symmetry between the near-critical 
VBS and Neel states — perhaps some kind of duality inherent to deconfined quantum 
criticality [which is the case for the U(l) version of the theory [16], but not explicitly in 
the SU(2) variant, which is the one of relevance for the J-Q model]. Note that it is not 
just the locations of the crossing points that are symmetric, but in fact the correlation 
length curves themselves look rather symmetric in Fig. 83. By reflecting one of the 
graphs about the critical point, J /Q — {J/Q)c — ^ — [J/Q— {J/Q)c], the spin and dimer 
correlation lengths for the largest system fall almost on top of each other, i.e., even the 
scaling functions for these correlation lengths appear to be the same. 
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FIGURE 85. Uniform susceptibility x/T the J-Q model graphed on linear (left) and logarithmic 
(right) temperature scales, for three coupling ratios in the neighborhood of the critical value, {J/Q)c = 
0.04498(3). The solid curve is a fit to the form x/T = a + b\n{Q/T) + cT^ at J/Q = 0.045 (which is 
within the error bar of the critical value). The SSE calculations were carried out using sufficiently large 
lattices (up to L = 512 at the lowest temperatures) to obtain results representing the thermodynamic limit. 



Quantum critical susceptibility scaling atT > 0. At a conventional quantum critical 
point, one would expect an asymptotic linear temperature dependence of the uniform 
magnetic susceptibility, as we saw in the dimerized Heisenberg model in Fig. 79, where 
corrections in the form of higher powers of T are also visible in the ratio x/T- In the 
J-Q model there are much stronger corrections to the expected quantum critical form. 
As shown in Fig. 85, there appears to be a multiplicative logarithmic correction, i.e., the 
asymptotic T critical behavior may be of the form x ~ rin(l/r). The results are 
graphed both on linear and logarithmic temperature scales, in order to clearly convey 
how the behavior differs from that of the dimerized model in Fig. 79. The behavior in 
the J-Q model does not appear to be compatible with a conventional higher power-law 
correction, while a logarithm describes the behavior very nicely. 

Logarithmic corrections have not been suggested based on the present theory of 
deconfined quantum criticality. All analytical calculations within the non-compact CP^ 
field theory proposed to describe the transition have been performed using large-A'^ 
expansions of the SU(A^) generalization of the system (the field theory generalized to 
CP^^^ where = 2 is the physical number of spin components) [16, 16, 237]. It 
is possible that different features appear for = 2. On the other hand, Monte Carlo 
calculations of lattice versions of the field theory [241, 242, 236] have not pointed 
to any anomalous scaling corrections (although one only found first-order transitions 
[236]). Since these calculations are in disagreement with each other, the situation for the 
N = 2 theory remains unsettled. Logarithmic corrections in the susceptibility and other 
quantities do appear in related fermionic gauge field theories [243]. As we discussed 
in Sec. 4.3 and 5.3.1, in field theory language logarithmic corrections can appear as a 
consequence of marginal operators, but other explanations are also possible. In principle 
the corrections may also not really be logarithmic — they could also be due to almost 
marginal (but in the end irrelevant) operators. Corresponding power-law corrections 
with a very small exponents may not be distinguishable from logs for the system 
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FIGURE 86. Illustration of how local fluctuations of valence bond pairs can lead to a gradual change 
of a columnar state (a) into a plaquette state (d). In (b) and (c) a small number of bond pairs have been 
flipped. Local horizontal o vertical fluctuations of the maximum number of all bond pairs can take place 
with the plaquettes (bond superpositions) arranged as in (d) [one of four equivalent translated patterns]. A 
fraction p G [0, 1] of vertical bond pairs on these plaquettes corresponds to a VBS angle (p e [0, n], defined 
using the x and y dimer order parameters (296). In these figures, angles (j) € [tt, 27r] correspond to shifting 
the vertical bonds up by one step up. 



sizes accessible. While this scenario is possible for some quantities, in the case of the 
uniform susceptibility that would also be highly unusual, since normally all corrections 
to quantum critical behavior in 2D antiferromagnets are integer powers of T [84]. 

Emergent U(l) symmetry in the VBS. A prominent feature of the theory of decon- 
fined quantum criticality is that, in addition to the normal correlation length <^ (which 
can be taken as the dimer correlation length in the Neel state and the spin correlation 
length in the VBS state), there is another length scale A in the VBS phase. This is the 
distance scale on which excited 5=1/2 spinous (which are defects, a kind of vortices, 
in the VBS [245]) are confined into 5=1 "triplons" (i.e., A is the size of the spinon 
bound state). The same length scale should also be associated with an emergent U(l) 
symmetry, related to a particular kind of quantum fluctuations connecting two different 
kinds of VBS ordering patterns — the columnar state and the plaquette state. These "an- 
gular" VBS fluctuations and the two kinds of VBS states are illustrated in Fig. 86. The 
angular fluctuations occur on length scales less than A (i.e., on length scales less than 
A, the system is in a superposition of the two different VBS states). The excited spinous 
can move freely away from each other up to distance A, but at larger distances they 
become confined, due to an effective potential mediated by the VBS long-range order 
(and in this regard one can note interesting connections to confinement/deconfinement 
in certain gauge theoreis in particle physics [18]). As the critical point is approached, A 
should diverge as a power of the correlation length, A ~ i^"^, with the exponent a> I. 

The existence of an approximate U(l) symmetry in the VBS order parameter can 
be tested by examining the distribution P{Dx,Dy) of order parameters and Dy, 
corresponding to columnar order with x- and y-oriented bonds and defined as 

Z), = £S(r,-)-S(r, + x)(-l)-S = f;S(r,) ■S(r, + y)(-l)-^'-. (296) 

;=1 /=1 

For a given Monte Carlo configuration, and Dy can be evaluated at fixed imaginary 
time (propagated state in the SSE), using the improved estimator for the correlation func- 
tion discussed in Sec. 5.2.5. The improved estimator is spin-rotational invariant, which 
can best be understood by considering the equivalence between the loop formulation 
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FIGURE 87. VBS order parameter distributions P{D,r,Dy) calculated for L = 128 systems at J/Q = 
(left) and 0.045 (right). The histograms are shown in regions \Dx | , | Av | < Dcm, with Dcut chosen for the two 
coupling ratios to cover the region where the weight is significant. Brighter features correspond to higher 
probability density. The absolute scale is not important here, only the circular shapes of the distributions. 
The weigh is not distributed completely uniformly in these histograms [obeying neither perfect Z4 nor 
U(l) symmetry], due to the inefficiency of the simulations to fluctuate the VBS angle for large systems. 



and the valence bond basis [244, 31, 50] (which we briefly discuss in Sec. 6). In fact, the 
improved loop estimator for an equal time correlation function is exactly equivalent to 
an expectation value of the operator in a particular valence bond state, provided that the 
temperature is sufficiently low for the simulation only to sample the singlet sector. 

While the calculation of P{Djc, Dy) could be done with the SSE method, the results to 
be discussed below were instead generated with a ground state method in the valence 
bond basis (which in its most recent formulation [50] actually is very similar to the 
SSE method, as discussed in Sec. 6). The rotationally invariant estimator is not strictly 
needed here [109], and one can also just use the z spin components in (296), as we did 
in the correlation function Eq. (295). In the valence bond basis it is, however, natural to 
compute rotation-invariant quantities. Note that P{Dx,Dy) is not an operator expectation 
value (i.e., it is not a normal physical observable), just a distribution of individual 
measurements in a simulation carried out in a particular basis. Its symmetry properties 
nevertheless reflect the actual symmetries of the VBS order parameter of the system. 

We have already looked at a classical analogy of /'(D^, Dy) ; the distribution P{Mx,My) 
of stripe order parameters in the frustrated Ising model, with results shown in Fig. 19. 
In that case the four-fold symmetry of the order parameter at and below the transition 
temperature is obvious, while going to very high temperatures (not shown in the fig- 
ure) the distribution turns into a central peak with full rotational symmetry [i.e., U(l) 
symmetry] because of the independent Gaussian fluctuations of the short-range order in 
different parts of the system. In a similar way, in a VBS one should expect P{Dx^ Dy) 
to be U(l) symmetric in the Neel state (where there is only short-range VBS correla- 
tions), and become four-fold (Z4) symmetric inside the VBS state. As seen in Fig. 87 
(which shows recent results for larger systems than in [17, 240, 108]), this is not quite 
the case, however. Inside the VBS phase, at 7/2 = 0, the distribution is ring shaped, 
with no weight at the center and no signs of a four-fold symmetry. This shows that the 
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magnitude D, with = + Dy, of the VBS order parameter has formed, (D^) > 0, 
but its angular symmetry has not yet been established, i.e., while the magnitude of the 
VBS order parameter has relatively small fluctuations, the fluctuations in the individual 
components Dx and Dy [which define an angle ^ of the VBS in the (D^^Dy) plane] are 
very large. Close to the critical point, at 7/2 = 0.045 in the figure, the distribution is 
peaked at the center, as expected, but again there is no Z4 symmetry. 

In the continuum field theory, the operator responsible for the VBS formation is 
dangerously irrelevant [16, 245, 246], which means that although it is a perturbation 
with Z4 symmetry, which leads to a Z4 symmetry-broken VBS state, this perturbation 
does not affect the critical point. The theory of deconfined quantum critical point has 
built-in U(l) symmetry, and this is unchanged in the presence of the VBS operator. 
The J-Q model has no explicit U(l) symmetry — what is built in here is instead the Z4 
discrete rotational symmetry of the square lattice, and naively one would expect the 
VBS forming on this lattice to exhibit Z4 symmetry, exactly as the stripe order of the 
Ising model discussed in Sec. 3.4. Instead, we see a U(l) symmetric order parameter 
distribution /'(Z);c,Z)y), which in light of the theory of deconfined quantum critical points 
should be expected if the system is smaller than the deconfinement length scale A (which 
therefore has to be large, A > 128 even atJ/Q — according to the results in Fig. 87). 
If we increase the system size, we should eventually, for L > A, obeserve a four-fold 
symmetric distribution (which has been seen in some related systems [108, 247, 248]). 
A columnar state corresponds to peaks at (0, ±D) and (±Z), 0) . 

As discussed above and illustrated in Fig. 86, the emergent U(l) symmetry is a 
consequence of local fluctuations between columnar and plaquette VBS patterns [245], 
in such a way that all possible values of Dx and Dy compatible with a magnitude D 
(weakly fluctuating) are sampled equally (or almost equally — perfect U(l) symmetry 
only applies at the critical point). There is an analogy in elementary quantum mechanics; 
a particle in a slightly assymmetric double well. If the barrier between the wells is very 
high, then the particle is localized in the deeper well, but if the barrier is not very high 
(relative to the kinetic energy), then the particle fluctates between both wells and the 
wave function also has weight inside the barrier. In the VBS, the wells correspond to the 
columnar and plaquette states, at angles mt/2 (columnar) and nn/2 + n/4 (plaquette), 
with n = 0, 1, 2, 3, and fluctuations between them correspond to non-zero probability in 
the continuum of angles between these points. In the deconfinement theory, the columnar 
and plaquette VBS states are almost degenerate, becoming exactly degenerate at the 
critical point. Approaching the critical point, the barrier between them also is reduced. 
In the double-well analogy, the wells are becoming more degenerate and the barrier 
between them is reduced. There will then be significant fluctuations between columnar 
and plaquette states in finite systems (up to L ~ A), and this implies P{Dx,Dy) with 
weight at all the angles and an U(l) to Z4 cross-over of the symmetry when L > A. 
This applies also to the order parameter computed on subsystems of finite length L in an 
infinite (or very large) system (which has been studied in classical models with a ^-fold 
order parameter which exhibits U(l) to Zq symmetry cross-over [249, 250]). 

Since the Z4 symmetry is not yet clearly manifested in the J-Q model up to the largest 
lattices studied so far, the issue of the nature of the VBS remains open. However, a J-Q 
model in which the Q term consists of products of three parallel singlet projectors, the 
VBS order is more robust and a Z4 symmetric distribution corresponding to a columnar 
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state is seen clearly [108]. Simulations of SU(A'') generalizations of the standard J-Q 
model, with N — 3 and 4, also show columnar VBS ground states [108]. Most likely, 
the standard SU(2) J-Q model also has a columnar state, and there is indeed some weak 
hints of this in recent (ongoing) long simulations for L — 64. 

For system sizes L ^ A, the VBS distribution should be four-fold symmetric, and by 
investigating how the change from U(l) to Z4 symmetry takes place one can in principle 
determine the exponent a relating the deconfinement length scale A and the correlation 
length ^ . Classical examples of the cross-over phenomenon, which also inspired the 
theory [16], were studied in Refs. [249, 251, 250, 252]. A finite-size scaling method 
for a quantity sensitive to Z4 symmetry can be used to extract the exponent a [252]. In 
the standard J-Q model, such an analysis cannot yet be performed, because the cross- 
over into a Z4 symmetric order parameter is not seen clearly enough in the currently 
accessible lattices. The analysis has, however, been carried out in the J-Q model with 
columnar six-spin interactions, with the result that a 1.2. 

Other critical exponents. In addition to the dynamic exponent z = 1, other critical 
exponents have been analyzed in several QMC studies [17, 238, 239, 108, 110]. The 
most noteworthy result is that rj^ (the exponent of the critical spin-spin correlation 
function) is anomalously large, rjs ~ 0.3, which is in qualitative agreement with the 
theory of deconfined quantum critical points. A "large" rjs was argued for [16] based on 
the N = 00 value r/y = 1 in the 1 /N expansion of the CP^~^ field theory (i.e., different 
from the conventional mean-field value 7]^ = 0). The l/N corrections are difficult to 
compute, but in general support an unusually large exponent [237] (although it is not 
possible to extend the results reliably to N = 2). The behavior of the dimer-dimer 
exponent 77^/ obtained in QMC calculations for SU(A^) systems [108] seems to follow 
the scaling with N predicted in the theory [246, 253] (where 7]^ diverges withN). 

In light of the presence of significant scaling corrections, possibly logarithmic, in 
the susceptibility analyzed in Fig. 85, and also in the spin stiffness [110], an important 
question is whether the calculated exponents also are affected. This has so far not 
been apparent, because standard data collapse procedures with reasonable values of the 
subleading exponents work very well [108]. The best current estimates for 77^, based on 
both r > [238] and r = [108] QMC calculations agree well with each other. On 
the other hand, the results discussed here and in [110] put the critcal point at slightly 
higher J/Q than previously, and also the correlation length exponent V analyzed with 
a logarithmic correction included [110] is significantly smaller than other estimates 
[238, 108]. It would therefore be good to repeat the studies of all the exponent, using 
several different quantities and larger lattices (which is work in progress). 

There is also evidence for logarithmic corrections in the J-Q model from studies of its 
response to impurities. Introducing a vacancy into the system and studying the spatial 
distribution of the resulting magnetization imbalance, one can normally, at conventional 
quantum critical points in dimerized models, observe scaling with the system size in 
this distribution [254, 255]. In the J-Q model, such scaling analysis again requires the 
introduction of a multiplicative logarithmic correction [255]. 

Applications of other methods to the J-Q model. One interesting and puzzling as- 
pects of the J-Q model is that it seems very difficult to capture its properties correctly 



Computational Studies of Quantum Spin Systems 



January 18, 2011 



194 




FIGURE 88. In the J-Q3 model studied here, three singlet projectors are arranged in a staggered pattern. 
All distinct orientations (as shown) and translations of the projector products are included. 



using analytical many-body techniques, even ones that can rather accurately locate the 
Neel-VBS transition in the 2D frustrated J1-J2 Heisenberg model. A mean-field treat- 
ment starting from a columnar dimer state gives a critical J /Q very far from the QMC 
result [256]. This approach can be improved to better take into account some of the local 
fluctuations on plaquettes, which improves the value of the critical point but seems to 
results in a strongly first-order transition [256]. Cluster mean-field calculations converge 
very poorely with the cluster size [257]. These results point to unusually strong non-local 
quantum fluctuations [256], which cannot be easily captured with local approaches start- 
ing from small clusters or conventional fluctuations around a fixed dimer pattern. The 
reason for these difficulties to capture the VBS state may be the emergent U(l) sym- 
metry, which makes it difficult to obtain both the correct long-distance behavior (likely 
columnar order) as well as the strong fluctuations between columnar and plaquette order 
on shorter length scales. 

First-order transition in a staggered J-Q model. One way to test the link between 
emergent U(l) symmetry and a continuous Neel-VBS transition is to construct a model 
in which the local fluctuations responsible for rotating the VBS angle are suppressed. 
Intuition for how to accomplish this comes from the Rokhsar-Kivelson (RK) quantum 
dimer model [258, 259], which can be regarded as an effective model for an extreme 
nonmagnetic system dominated by short valence bonds (for which the internal singlet 
structure is also neglected — the bond configurations are regarded as orthogonal states). 
The RK hamiltonian on the square lattice can be written as //rk = vV — kK, where 
V is the diagonal (potential-energy) operator, which counts the number of flippable 
plaquettes [parallel bond pairs, exactly as in Fig. 86(b,c)], and K is an off-diagonal 
(kinetic) term which flips such a pair. This model has a critical point at ^ = v which 
separates a plaquette VBS state [similar to the one in Fig. 86(d)] for v < and a staggered 
VBS state [with the bond pattern exactly as in Fig. 4(c)]. While the plaquette state is 
destroyed continuously by quantum fluctuations as v — t- fc^, the staggered state (of which 
there are four symmetry-related equivalent ones) has no fluctuations, because it has no 
flippable plaquettes. The transition upon v — )■ ^+ is therefore first-order. 

This simple picture of the RK model suggests that an actual staggered VBS in a spin 
model also should have strongly suppressed local fluctuations, and therefore should 
not be associated with an emergent U(l) symmetry. Due to the suppression of local 
fluctuations (and therefore also of large-scale fluctuations), the transition between it 
and the Neel state should be first-order. The picture is not complete, however, because 
clearly there must be some fluctuations in the staggered VBS state, considering that 
a reasonable spin hamiltonian will be quite far from a dimer model and the valence 
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FIGURE 89. Evidence for a first-order transition in the saggered J-Q3 model [109]. The Binder cumu- 
lant of the sublattice magnetization (a) exhibits a negative peak, which grows with the system size, (b) 
Hysteresis effects are observed: simulations in which the coupling ratio is slowly increased and decreased 
give different properties in large systems, with the energies crossing each other at a point which can 
be taken as a size-dependent transition point. Extrapolating these points for different system sizes gives 
{Qt,/J)c — 1.1933(1), which is marked with the vertical dashed line in the left panel. All these results 
were obtained with SSE simulations at inverse temperature Q^jT =L. 



bond configurations making up the gound state should also include some fraction of 
longer bonds (in the presence of which some local fluctuations are always possible). It 
is therefore worth testing this picture in simulations of a model whose ground state can 
be tuned from a Neel antiferromagnet to a staggered VBS. Such a model can easily be 
constructed within the J-Q framework, by arranging the singlet projectors of the Q term 
in a staggered fashion, instead of within a 2 x 2 plaquette (or in columns of three or more 
projectors, as in [108]). It turns out, however, that a Q term consisting of two staggered 
projectors is not sufficient for destroying the Neel state. With three projectors, arranged 
as in Fig. 88 (and the coupling of which we call ^3), a staggered VBS is stabilized, 
however. We here briefly summarize the evidence for a first-order Neel — VBS found in 
a recent SSE study of the staggered J-Q3 model [109] . 

Some results are shown in Fig. 89. The Binder cumulant of the Neel order parameter 
exhibits a negative peak, which grows with increasing system size. This is a strong 
indication of a first-order transition, as discussed in Sec. 3.4 (and the results can be 
compared with those of the classical frustrated Ising model in Fig. 20). The data become 
very noisy for L > 12, because of the inefficiency of the simulations in transitioning 
between the two coexisting states. For L > 24 the system can get completely trapped in 
a metastable state. On the VBS side close to the transition, if the simulation happens to 
enter a Neel like configuration during the equilibration, it will stay in a metastable Neel 
state for a very long time (and vice versa on the other side of the transition). One can 
use this trapping to investigate the transition in other ways. By doing a series of runs 
at closely spaced values of Qs//, starting either far inside the VBS or Neel phase, and 
starting each new run from the last configuration of the previous run, one can follow the 
ground state into its corresponding metastable state at the "wrong" side of the transition. 
The energies computed in two such runs, with Q^jJ either increasing or decreasing. 
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are shown in Fig. 89. The two branches cross each other, which clearly shows the first- 
order nature of the transition. The level crossing is most likely an avoided one, between 
two states with the same quantum numbers, but the spacing between the levels should 
be extremely small here. This spacing should be inversely related to the tunneling time 
between the two states at the coexistence point (but note that the relationship between 
simulation time and the real time dynamics of the system is not known). 

The crossing point of the two energy branches can be taken as a finite-size definition 
of the critical coupling. It shifts with the system size approximately as 1/L^, which is 
consistent with a first-order transition with dynamic exponent z = 1 [260]. The extrap- 
olated transition point is Qj^/J — 1.1933(1). At the transition, the VBS order parameter 
is about 75% of its maximum value (in a perfect staggered valence bond state), which is 
rather large and motivates the classification of the transition as strongly first-order. 

6. SURVEY OF RELATED COMPUTATIONAL METHODS 

In these lecture notes we have discussed exact diagonalization methods and the SSE 
QMC method in some detail and also looked at some illustrative calculations with results 
for 5 = 1 /2 models. There are many issues that were left untouched, regarding methods 
as well as physics, by restricting the discussion to S = 1/2 systems, and within that class 
also to spin-isotropic systems. The exact diagonalization approach can be easily used 
for any spin model, but the rapidly growing size of the Hilbert space with S imposes 
even more severe restrictions on the lattices sizes for 5 > 1 /2 systems. The world line 
and SSE QMC methods can be generalized to any spin model, as long as there is no 
frustration leading to sign problems. In this section we briefly summarize some QMC 
schemes for more general spin models, as well as some further recent developments for 
isotropic 5=1/2 systems. 

There are several other important computational methods that go beyond the scope of 
these lecture notes, such as series expansions techniques (high-temperature expansions 
and expansions around some solvable limit of the ground state, e.g., the Ising limit of an 
anisotropic Heisenberg model) [261, 262] and the DMRG method [28, 29]. While series 
expansions can in principle be applied to any model, there are issues with extrapolating 
the series in a controllable manner, and one cannot, in most cases of interest, expect to 
reach the same level of accuracy as in QMC simulations of sign problem free models 
(e.g., in studies of quantum phase transitions). Series expansion methods are neverthe- 
less one of the most powerful classes of methods currently available for 2D frustrated 
quantum spin systems [100], where QMC methods are limited by the sign problem (but 
note recent progress in controlling the sign problem at high temperatures [193]). More 
powerful series expansion schemes are also still being actively developed [263], and one 
can expect progress to continue on this front. The DMRG method is very powerful for 
ID systems and can also be used for 2D systems of moderate size [94]. Extensions of the 
DMRG method and the related matrix-product states [26, 264, 21] to higher-dimensional 
"tensor-network" states [21, 106, 265] are currently being explored very intensely. This 
is an exciting line of research, that may eventually lead to viable schemes for unbiased 
studies of frustrated spins (and even fermionic systems). It appears unlikely, however, 
that these methods will, for the foreseeable future, be able to probe quantum spin systems 
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FIGURE 90. Examples of the four possible paths through vertices in the anisotropic 5=1/2 Heisenberg 
model (which may include also an external magnetic field) and the new vertices generated when moving 
along the paths in the directed loop algorithm. The vertical lines at the vertices to the left in each box 
indicate the entrance and exit legs (with either direction of movement being possible). Given an entrance 
leg, the exit must be chosen such that a new allowed vertex is generated when the leg spins are flipped (the 
vertices to the right, without the path indicated). In the "bounce" process (d), the exit is at the same leg as 
the entrance and the vertex is not modified. There are six allowed vertices in the generic anisotropic 
Heisenberg model — those in which the spin (z component) is conserved. In the "deterministic" loop 
algorithm discussed in Sec. 5, there are only four vertices and only paths of type (a) are allowed 



at the level of precision possible with existing QMC methods for unfrustrated systems. 

In light of the above advantages and disadvantages of different methods, the author 
advocates a two-pronged approach in research on quantum spin models and related 
systems: (i) Explore new and improved schemes (including also QMC schemes) that 
may be useful to study models beyond the reach of currently available QMC techniques, 
(ii) Study interesting physics with state-of-the-art QMC methods for sign problem free 
models. It is far from true that everything that can be done has already been done with 
sign problem free models (although one can some times hear such claims)! With the 
QMC methods available now, such as the SSE algorithm described in Sec. 5, and with 
the power of modern computers (which still is increasing at an amazing pace, e.g., with 
a steadily increasing numbers of cores per CPU), it is possible to access very interesting 
physics that was beyond reach only a few years ago. The discussion of the dimerized and 
J-Q classes of models in Sec. 5.3.5, in particular, has hopefully convinced the reader that 
there is much to explore in such systems, and beyond (e.g., random systems exhibiting 
unusual states and quantum phase transitions [266, 267, 268, 278]). 

Directed loop QMC algorithms. The directed loop algorithm is a generalization of 
the SSE operator-loop scheme, introduced in [33] (building on a previous less efficient 
formulation [190]) and applied there to S = 1/2 Heisenberg systems with interaction 
anisotropics (Ising and XY) and external magnetic field. The main difference between 
loops and directed loops (in both SSE and world line formulations) is that the path 
through the vertices is not unique in the directed loop scheme. The path (the exit leg, 
given an entrance leg to a vertex) is chosen according to certain probabilities, constructed 
to maintain detailed balance in a space with two defects — the open loop ends (in a 
way similar in spirit to the "worm" algorithm [32]). For 5=1/2 systems, there are 
four types of vertex paths, corresponding to the location of the exit relative to the 
entrance, as illustrated and discussed in more detail in Fig. 90. The path probabilities, 
which should be solutions of the corresponding directed loop equations (for detailed 
balance) are in general not unique. For some models, eliminating the "bounce" process 
in Fig. 90(d) reduces the directed loop scheme to one of the standard loop updates 
previously developed for S = 1/2 (as in the isotropic system discussed in Sec. 5) 
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and higher-5 models [191, 31, 189]. In other cases the directed loops (and worm) 
algorithms allow for efficient simulations where the standard loop algorithms are not 
applicable [33, 269, 270, 271]. Here it can also be mentioned that it is some times useful 
in SSE directed loop algorithms to work with operators defined on cells larger than 
those containing the elementary operators of the Hamiltonian (e.g., the two-spin bond 
operators of the Heisenberg model) [272]. This allows for more options in choosing how 
operators are reconfigured when vertices are traversed by the loops, which some times 
can make simulations much more efficient. 

QMC algorithms in the valence-bond basis. In Sec. 2.2 we briefly discussed the 
valence bond basis, in which a basis state is a product of two-spin singlets, as in 
Eq. (20), and in which any total singlet singlet state can be expanded. The basis is 
overcomplete and non-orthogonal, which implies that such an expansion is not unique. 
One way to work with the valence bond basis is to construct and optimize variational 
states, the simplest type of which is an amplitude product states [273]. Such a state is a 
superposition 

l^)=L^a|^a), (297) 
a 

where \Va) is a valence bond product state of the form (20) and the expansion coeffi- 
cients are products of amplitudes /j(r« ;) corresponding to the "shapes" of the bonds (the 
bond lengths in the x and y direction in the case of a 2D system), with / = 1, . . 
referring to theN/2 valence bonds in the configuration (bond tiling) labeled by a; 

N/2 

^a = n/^(ra,0- (298) 

Amplitude product states of this kind can closely reproduce the ground states of many 
bipartite Heisenberg systems, for which each singlet (a^b) — (| '\aib) ~ I ia'tb))/V2 
should be defined with site a on sublattice A and site b on sublattice B. With all positive 
expansion coefficients in (297), this convention for the singlet sign corresponds to 
Marshall's sign rule for the ground state wave function of a bipartite system [274], 
which in the standard t, \. spin basis can be written as 

sign[^«] = ^a/l^al = (-1)"^^ (299) 

where nA\. is the number of \, spins on sublattice A. 

The properties of these bipartite amplitude product states can be studied by Monte 
Carlo sampling of the valence bonds [273, 275, 50]. For the 2D Heisenberg model, 
the state with all the amplitudes h{x,y) variationally optimized is a Neel state with 
properties in very good agreement with the true ground state [275], e.g., the energy is 
within 0. 1% and the sublattice magnetization within 1% of the values obtained in QMC 
calculations. For frustrated systems, appropriate sign rules are not known (and may be 
too complicated to write down in practice). 

The idea to use amplitude product states as a starting point for projector QMC 
simulations is quite old [276, 277], but was only recently developed into a generic 
and efficient tool [47, 49, 50]. The general idea of a projector scheme is the same as 
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FIGURE 91. Example (from Ref. [50]) of a loop configuration in the valence-bond projector QMC 
method formulated in a combined space of spins (open and solid circles) and valence bonds (the arcs 
capping the loops on the left and right boundaries). The bars with four spins are diagonal and off-diagonal 
vertices with the same meaning and function as in the SSE operator-loop method (illustrated in Fig. 61). 
Loops form according to the connectivity of the vertices, and also through the valence bonds at the 
boundaries, and can be flipped without changing the configuration weight. The sum over all possible 
loop configurations corresponds exactly to the formulation of the projector scheme in the pure valence 
bond basis (i.e., without using the spins at all) [47]. Using the spins allows for a more efficient sampling 
of the configurations. Expectation values are evaluated using loop estimators at the mid-point indicated 
with a vertical dashed line, and this is analogous to the improved estimators (discussed in Sec. 5.2.5) in 
the finite-temperature versions of the loop algorithm. 



we discussed in the context of Krylov space methods in Sec. 4.2: By acting with a 
high power of the hamiltonian on an arbitrary state |^), only the component of 
that state with the largest energy eigenvalue — normally the ground state (and if not this 
can be arranged by subtracting a constant from H) — survives in the limit A — t- oo, as 
demonstrated by Eq. (170). Expectation values of the form 

where |^) is a valence bond state or superposition (e.g., an amplitude product state), can 
be sampled using IVIonte Carlo simulations. In the original formulations of this approach 
[62, 47], H is written as a sum of singlet projector operators (as in the Heisenberg 
antiferromagnet), or products of singlet operators (as in J-Q models), and strings of such 
operators are sampled (along with the valence bond configurations of |^)). The singlet 
projectors 5,j, defined in Eq. (21), lead to simple reconfigurations of bond pairs when 
acting on valence bond states. When acting on a valence bond, this operator is diagonal 
with eigenvalue 1 , 

Sab{a,b) = {a,b), (301) 

while acting on a pair of different valence bonds leads to a simple reconfiguration of 
those bonds, with matrix element 1/2; 

Sbc{a,b){c,d) = i(c,Z?)(a, J), (302) 

This can be shown easily by going back to the basis of f and | spins. Note the order of 
the indices within the singlets, which reflects consistently the convention corresponding 
to IVlarshall's sign rule discussed above; a, c are on sublattice A and b, d on sublattice B. 
These rules allow for a well-defined path integral (or SSE) like propagation of the states, 
and these propagation paths can be sampled using a IVIonte Carlo scheme. 
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In a more recent formulation of the valence bond projector method [50], the f and 
I spin basis is reintroduced, by expressing all singlets in terms of their sums over 
antiparallel spin pairs. This leads to an algorithm very similar to SSE and world line 
loop methods. Essentially, the periodic time boundaries used when simulating a system 
at a fixed temperature are cut open and replaced by two separate boundaries at which the 
valence bond states live. This is illustrated and discussed in more detail with a simple 
configuration for a four-site Heisenberg chain in Fig. 91. 

The valence bond basis (or its translation to loop methods [244, 50]) has some unique 
aspects which makes it possible to access observables that are normally out of reach 
or difficult to calculate. In an extended valence bond basis for the triplet sector, one 
can study some properties of excited states [47, 49, 278]. One can also generalize the 
valence bond basis to include one [279] or several [278] unpaired spins, which is useful 
for studies of, e.g., the magnetization distribution in systems with unequal sublattice 
occupation of the spins. Simulations in the valence bond basis have also recently found 
applications in studies of entanglement entropy [23, 280, 281, 282, 283]. One can also 
extend valence bond projector methods to SU(N) spins [108] (including even non-integer 

generalizations [248]) and other related symmetry groups [282]. 
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